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GENSET.019A PATENT 

EXTENDED cDNAs 
Related Applications 
The present application claims priority from United States Provisional Patent 
5 Application Serial No. 60/069,957 filed December 17, 1997, United States Provisional 
Patent Application Serial No. 60/074,121 filed February 9, 1998, United States 
Provisional Patent Application Serial No. 60/081,563, filed April 13, 1998, United States 
Provisional Patent Application Serial No. 60/096,1 16, filed August 10, 1998, and United 
States Provisional Patent Application Serial No. 60/099,273 filed September 4, 1998 the 
1 0 disclosures of which are incorporated herein by reference in their entirety. 

Table I lists the SEQ ID Nos. of the extended cDNAs in the present application, 
the SEQ ID Nos. of the extended cDNAs in the provisional applications, and the identities 
of the provisional applications in which the extended cDN As were disclosed. 

Background of the Invention 
15 The estimated 50,000-100,000 genes scattered along the human chromosomes 

offer tremendous promise for the understanding, diagnosis, and treatment of human 
diseases. In addition, probes capable of specifically hybridizing to loci distributed 
throughout the human genome find applications in the construction of high resolution 
chromosome maps and in the identification of individuals. 
20 In the past, the characterization of even a single human gene was a painstaking 

process, requiring years of effort. Recent developments in the areas of cloning vectors, 
DNA sequencing, and computer technology have merged to greatly accelerate the rate at 
which human genes can be isolated, sequenced, mapped, and characterized. Cloning 
vectors such as yeast artificial chromosomes (YACs) and bacterial artificial chromosomes 
25 (BACs) are able to accept DNA inserts ranging from 300 to 1000 kilobases (kb) or 100- 
400 kb in length respectively, thereby facilitating the manipulation and ordering of DNA 
sequences distributed over great distances on the human chromosomes. Automated DNA 
sequencing machines permit the rapid sequencing of human genes. Bioinformatics 



software enables the comparison of nucleic acid and protein sequences, thereby assisting 
in the characterization of human gene products. 

Currently, two different approaches are being pursued for identifying and 
characterizing the genes distributed along the human genome. In one approach, large 
5 fragments of genomic DNA are isolated, cloned, and sequenced. Potential open reading 
frames in these genomic sequences are identified using bio-informatics software. 
However, this approach entails sequencing large stretches of human DNA which do not 
encode proteins in order to find the protein encoding sequences scattered throughout the 
genome. In addition to requiring extensive sequencing, the bio-informatics software may 

10 mischaracterizethe genomic sequences obtained. Thus, the software may produce false 
positives in which non-coding DNA is mischaracterizedas coding DNA or false negatives 
in which coding DNA is mislabeled as non-coding DNA. 

An alternative approach takes a more direct route to identifying and characterizing 
human genes. In this approach, complementary DNAs (cDNAs) are synthesized from 

1 5 isolated messenger RNAs (mRNAs) which encode human proteins. Using this approach, 
sequencing is only performed on DNA which is derived from protein coding portions of 
the genome. Often, only short stretches of the cDNAs are sequenced to obtain sequences 
called expressed sequence tags (ESTs). The ESTs may then be used to isolate or purify 
extended cDNAs which include sequences adjacent to the EST sequences. The extended 

20 cDNAs may contain all of the sequence of the EST which was used to obtain them or only 
a portion of the sequence of the EST which was used to obtain them. In addition, the 
extended cDNAs may contain the full coding sequence of the gene from which the EST 
was derived or, alternatively, the extended cDNAs may include portions of the coding 
sequence of the gene from which the EST was derived. It will be appreciated that there 

25 may be several extended cDNAs which include the EST sequence as a result of alternate 
splicing or the activity of alternative promoters. 

In the past, the short EST sequences which could be used to isolate or purify 
extended cDNAs were often obtained from oligo-dT primed cDNA libraries. 
Accordingly, they mainly corresponded to the 3' untranslated region of the mRNA. In 



part, the prevalence of EST sequences derived from the 3' end of the mRNA is a result of 
the fact that typical techniques for obtaining cDNAs, are not well suited for isolating 
cDNA sequences derived from the 5' ends of mRNAs. (Adams et al., Nature 377:174, 
1996,HillieretaL, Genome Res. 6:807-828, 1996). 
5 In addition, in those reported instances where longer cDNA sequences have been 

obtained, the reported sequences typically correspond to coding sequences and do not 
include the full 5' untranslated region of the mRNA from which the cDNA is derived. 
Such incomplete sequences may not include the first exon of the mRNA, particularly in 
situations where the first exon is short. Furthermore, they may not include some exons, 

10 often short ones, which are located upstream of splicing sites. Thus, there is a need to 
obtain sequences derived from the 5' ends of mRNAs which can be used to obtain 
extended cDNAs which may include the 5' sequences contained in the 5' ESTs. 

While many sequences derived from human chromosomes have practical 
applications, approaches based on the identification and characterization of those 

15 chromosomal sequences which encode a protein product are particularly relevant to 
diagnostic and therapeutic uses. Of the 50,000- 1 00,000 protein coding genes, those genes 
encoding proteins which are secreted from the cell in which they are synthesized, as well 
as the secreted proteins themselves, are particularly valuable as potential therapeutic 
agents. Such proteins are often involved in cell to cell communication and may be 

20 responsible for producing a clinically relevant response in their target cells. 

In fact, several secretory proteins, including tissue plasminogen activator, G-CSF, 
GM-CSF, erythropoietin, human growth hormone, insulin, interferon-a, interferon-P, 
interferon-y, and interleukin-2, are currently in clinical use. These proteins are used to 
treat a wide range of conditions, including acute myocardial infarction, acute ischemic 

25 stroke, anemia, diabetes, growth hormone deficiency, hepatitis, kidney carcinoma, 
chemotherapy induced neutropenia and multiple sclerosis. For these reasons, extended 
cDNAs encoding secreted proteins or portions thereof represent a particularly valuable 
source of therapeutic agents. Thus, there is a need for the identification and 
characterizationof secreted proteins and the nucleic acids encoding them. 



In addition to being therapeutically useful themselves, secretory proteins include 
short peptides, called signal peptides, at their amino termini which direct their secretion. 
These signal peptides are encoded by the signal sequences located at the 5' ends of the 
coding sequences of genes encoding secreted proteins. Because these signal peptides will 
5 direct the extracellular secretion of any protein to which they are operably linked, the 
signal sequences may be exploited to direct the efficient secretion of any protein by 
operably linking the signal sequences to a gene encoding the protein for which secretion is 
desired. This may prove beneficial in gene therapy strategies in which it is desired to 
deliver a particular gene product to cells other than the cell in which it is produced. Signal 

10 sequences encoding signal peptides also find application in simplifying protein 
purification techniques. In such applications, the extracellular secretion of the desired 
protein greatly facilitates purification by reducing the number of undesired proteins from 
which the desired protein must be selected. Thus, there exists a need to identify and 
characterize the 5' portions of the genes for secretory proteins which encode signal 

15 peptides. 

Public information on the number of human genes for which the promoters and 
upstream regulatory regions have been identified and characterized is quite limited. In 
part, this may be due to the difficulty of isolating such regulatory sequences. Upstream 
regulatory sequences such as transcription factor binding sites are typically too short to be 

20 utilized as probes for isolating promoters from human genomic libraries. Recently, some 
approaches have been developed to isolate human promoters. One of them consists of 
making a CpG island library (Cross, S.H. et al.. Purification of CpG Islands using a 
Methylated DNA Binding Column, Nature Genetics 6: 236-244 (1994)). The second 
consists of isolating human genomic DNA sequences containing Spel binding sites by the 

25 use of Spel binding protein. (Mortlock et al, Genome Res, 6:327-335, 1996). Both of 
these approaches have their limits due to a lack of specificity or of comprehensiveness. 

5' ESTs and extended cDNAs obtainable therefrom may be used to efficiently 
identify and isolate upstream regulatory regions which control the location, developmental 
stage, rate, and quantity of protein synthesis, as well as the stability of the mRNA. (Theil 
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et al, BioFactors 4:87-93, (1993). Once identified and characterized, these regulatory 
regions may be utilized in gene therapy or protein purification schemes to obtain the 
desired amount and locations of protein synthesis or to inhibit, reduce, or prevent the 
synthesis of undesirable gene products. 
5 In addition, ESTs containing the 5' ends of secretory protein genes or extended 

cDNAs which include sequences adjacent to the sequences of the ESTs may include 
sequences useful as probes for chromosome mapping and the identification of individuals. 
Thus, there is a need to identify and characterize the sequences upstream of the 5' coding 
sequences of genes encoding secretory proteins. 

10 Summary of the Invention 

The present invention relates to purified, isolated, or recombinant extended 
cDNAs which encode secreted proteins or fragments thereof Preferably, the purified, 
isolated or recombinant cDNAs contain the entire open reading frame of their 
corresponding mRNAs, including a start codon and a stop codon. For example, the 

15 extended cDNAs may include nucleic acids encoding the signal peptide as well as the 
mature protein. Ahematively, the extended cDNAs may contain a fragment of the open 
reading frame. In some embodiments, the fragment may encode only the sequence of the 
mature protein. Alternatively, the fragment may encode only a portion of the mature 
protein. A further aspect of the present invention is a nucleic acid which encodes the 

20 signal peptide of a secreted protein. 

The present extended cDNAs were obtained using ESTs which include sequences 
derived from the authentic 5' ends of their corresponding mRNAs. As used herein the 
terms "EST" or "5' EST" refer to the short cDNAs which were used to obtain the extended 
cDNAs of the present invention. As used herein, the term "extended cDNA" refers to the 

25 cDNAs which include sequences adjacent to the 5' EST used to obtain them. The 
extended cDNAs may contain all or a portion of the sequence of the EST which was used 
to obtain them. The term "corresponding mRNA" refers to the mRNA which was the 
template for the cDNA synthesis which produced the 5' EST. As used herein, the term 
"purified" does not require absolute purity; rather, it is intended as a relative definition. 



Individual extended cDNA clones isolated from a cDNA library have been conventionally 
purified to electrophoretic homogeneity. The sequences obtained from these clones could 
not be obtained directly either from the library or from total human DNA, The extended 
cDNA clones are not naturally occurring as such, but rather are obtained via manipulation 
5 of a partially purified naturally occurring substance (messenger RNA). The conversion of 
mRNA into a cDNA library involves the creation of a synthetic substance (cDNA) and 
pure individual cDNA clones can be isolated from the synthetic library by clonal 
selection. Thus, creating a cDNA library from messenger RNA and subsequently 
isolating individual clones from that library results in an approximately 10^-10^ fold 

1 0 purification of the native message. Purification of starting material or natural material to 
at least one order of magnitude, preferably two or three orders, and more preferably four 
or five orders of magnitude is expressly contemplated. 

As used herein, the term "isolated" requires that the material be removed from its 
original environment (e.g., the natural environment if it is naturally occurring). For 

15 example, a naturally-occurring polynucleotide present in a living animal is not isolated, 
but the same polynucleotide, separated from some or all of the coexisting materials in the 
natural system, is isolated. 

As used herein, the term "recombinant" means that the extended cDNA is adjacent 
to "backbone" nucleic acid to which it is not adjacent in its natural environment. 

20 Additionally, to be "enriched" the extended cDNAs will represent 5% or more of the 
number of nucleic acid inserts in a population of nucleic acid backbone molecules. 
Backbone molecules according to the present invention include nucleic acids such as 
expression vectors, self-replicating nucleic acids, viruses, integrating nucleic acids, and 
other vectors or nucleic acids used to maintain or manipulate a nucleic acid insert of 

25 interest. Preferably, the enriched extended cDNAs represent 1 5% or more of the number 
of nucleic acid inserts in the population of recombinant backbone molecules. More 
preferably, the enriched extended cDNAs represent 50% or more of the number of nucleic 
acid inserts in the population of recombinant backbone molecules. In a highly preferred 
embodiment, the enriched extended cDNAs represent 90% or more of the number of 



nucleic acid inserts in the population of recombinant backbone molecules, "Stringent'', 
"moderate," and "low" hybridization conditions are as defined in Example 29. 

Unless otherwise indicated, a "complementary" sequence is fully complementary. 
Thus, extended cDNAs encoding secreted polypeptides or fragments thereof which are 
present in cDNA libraries in which one or more extended cDNAs encoding secreted 
polypeptides or fragments thereof make up 5% or more of the number of nucleic acid 
inserts in the backbone molecules are "enriched recombinant extended cDNAs" as defined 
herein. Likewise, extended cDNAs encoding secreted polypeptides or fragments thereof 
which are in a population of plasmids in which one or more extended cDNAs of the 
present invention have been inserted such that they represent 5% or more of the number of 
inserts in the plasmid backbone are " enriched recombinant extended cDNAs" as defined 
herein. However, extended cDNAs encoding secreted polypeptides or fragments thereof 
which are in cDNA libraries in which the extended cDNAs encoding secreted 
polypeptides or fragments thereof constitute less than 5% of the number of nucleic acid 
inserts in the population of backbone molecules, such as libraries in which backbone 
molecules having a cDNA insert encoding a secreted polypeptide are extremely rare, arc 
not "enriched recombinant extended cDNAs." 

In particular, the present invention relates to extended cDNAs which were derived 
from genes encoding secreted proteins. As used herein, a "secreted" protein is one which, 
when expressed in a suitable host cell, is transported across or through a membrane, 
including transport as a result of signal peptides in its amino acid sequence. "Secreted" 
proteins include without limitation proteins secreted wholly (e.g. soluble proteins), or 
partially (e.g. receptors) from the cell in which they are expressed. "Secreted" proteins 
also include without limitation proteins which are transported across the membrane of the 
endoplasmic reticulum. 

Extended cDNAs encoding secreted proteins may include nucleic acid sequences, 
called signal sequences, which encode signal peptides which direct the extracellular 
secretion of the proteins encoded by the extended cDNAs. Generally, the signal peptides 
are located at the amino termini of secreted proteins. 



Secreted proteins are translated by ribosomes associated with the "rough" 
endoplasmic reticulum. Generally, secreted proteins are co-translationally transferred to 
the membrane of the endoplasmic reticulum. Association of the ribosome with the 
endoplasmic reticulum during translation of secreted proteins is mediated by the signal 
peptide. The signal peptide is typically cleaved following its co-translational entry into 
the endoplasmic reticulum. After delivery to the endoplasmic reticulum, secreted proteins 
may proceed through the Golgi apparatus. In the Golgi apparatus, the proteins may 
undergo post-translational modification before entering secretory vesicles which transport 
them across the cell membrane. 

The extended cDNAs of the present invention have several important 
applications. For example, they may be used to express the entire secreted protein which 
they encode. Alternatively, they may be used to express portions of the secreted protein. 
The portions may comprise the signal peptides encoded by the extended cDNAs or the 
mature proteins encoded by the extended cDNAs (i.e. the proteins generated when the 
signal peptide is cleaved off). The portions may also comprise polypeptides having at 
least 10 consecutive amino acids encoded by the extended cDNAs. Alternatively, the 
portions may comprise at least 15 consecutive amino acids encoded by the extended 
cDNAs. In some embodiments, the portions may comprise at least 25 consecutive amino 
acids encoded by the extended cDNAs. In other embodiments, the portions may comprise 
at least 40 amino acids encoded by the extended cDNAs. 

Antibodies which specifically recognize the entire secreted proteins encoded by 
the extended cDNAs or fragments thereof having at least 10 consecutive amino acids, at 
least 15 consecutive amino acids, at least 25 consecutive amino acids, or at least 40 
consecutive amino acids may also be obtained as described below. Antibodies which 
specifically recognize the mature protein generated when the signal peptide is cleaved may 
also be obtained as described below. Similarly, antibodies which specifically recognize 
the signal peptides encoded by the extended cDNAs may also be obtained. 

In some embodiments, the extended cDNAs include the signal sequence. In other 
embodiments, the extended cDNAs may include the full coding sequence for the mature 



protein (i.e. the protein generated when the signal polypeptide is cleaved off). In addition, 
the extended cDNAs may include regulatory regions upstream of the translation start site 
or downstream of the stop codon which control the amount, location, or developmental 
stage of gene expression. As discussed above, secreted proteins are therapeutically 
5 important. Thus, the proteins expressed from the cDNAs may be useful in treating or 
controlling a variety of human conditions. The extended cDNAs may also be used to 
obtain the corresponding genomic DNA. The term "corresponding genomic DNA" refers 
to the genomic DNA which encodes mRNA which includes the sequence of one of the 
strands of the extended cDNA in which thymidine residues in the sequence of the 
1 0 extended cDNA are replaced by uracil residues in the mRNA. 

The extended cDNAs or genomic DNAs obtained therefrom may be used in 
forensic procedures to identify individuals or in diagnostic procedures to identify 
individuals having genetic diseases resulting from abnormal expression of the genes 
corresponding to the extended cDNAs. In addition, the present invention is useful for 
1 5 constructing a high resolution map of the human chromosomes. 

The present invention also relates to secretion vectors capable of directing the 
secretion of a protein of interest. Such vectors may be used in gene therapy strategies in 
which it is desired to produce a gene product in one cell which is to be delivered to 
another location in the body. Secretion vectors may also facilitate the purification of 
20 desired proteins. 

The present invention also relates to expression vectors capable of directing the 
expression of an inserted gene in a desired spatial or temporal manner or at a desired level. 
Such vectors may include sequences upstream of the extended cDNAs such as promoters 
or upstream regulatory sequences. 
25 In addition, the present invention may also be used for gene therapy to control or 

treat genetic diseases. Signal peptides may also be fused to heterologous proteins to direct 
their extracellular secretion. 

One embodiment of the present invention is a purified or isolated nucleic acid 
comprising the sequence of one of SEQ ID NOs: 40-140 and 242-377 or a sequence 



complementary thereto. In one aspect of this embodiment, the nucleic acid is 
recombinant. 

Another embodiment of the present invention is a purified or isolated nucleic acid 
comprising at least 10 consecutive bases of the sequence of one of SEQ ID NOs: 40-140 
5 and 242-377 or one of the sequences complementary thereto. In one aspect of this 
embodiment, the nucleic acid comprises at least 15, 25, 30, 40, 50, 75, or 100 consecutive 
bases of one of the sequences of SEQ ID NOs: 40-140 and 242-377 or one of the 
sequences complementary thereto. The nucleic acid may be a recombinant nucleic acid. 

Another embodiment of the present invention is a purified or isolated nucleic acid 

10 of at least 15 bases capable of hybridizing under stringent conditions to the sequence of 
one of SEQ ID NOs: 40-140 and 242-377 or a sequence complementary to one of the 
sequences of SEQ ID NOs: 40-140 and 242-377. In one aspect of this embodiment, the 
nucleic acid is recombinant. 

Another embodiment of the present invention is a purified or isolated nucleic acid 

15 comprising the full coding sequences of one of SEQ ID NOs: 40-140 and 242-377, 
wherein the full coding sequence optionally comprises the sequence encoding signal 
peptide as well as the sequence encoding mature protein. In a preferred embodiment, the 
isolated or purified nucleic acid comprises the full coding sequence of one of SEQ ID 
Nos. 40, 42-44, 46, 48, 49, 51, 53, 60, 62-72, 76-78, 80-83, 85-88, 90, 93, 94, 97, 99- 

20 102, 104, 107-125, 127, 132, 135-138, 140 and 242-377 wherein the full coding 
sequence comprises the sequence encoding signal peptide and the sequence encoding 
mature protein. In one aspect of this embodiment, the nucleic acid is recombinant. 

A further embodiment of the present invention is a purified or isolated nucleic acid 
comprising the nucleotides of one of SEQ ID NOs: 40-140 and 242-377 which encode a 

25 mature protein. In a preferred embodiment, the purified or isolated nucleic acid comprises 
the nucleotides of one of SEQ ID NOs: 40-44, 46, 48, 49, 51-53, 55, 56, 58-72, 75-78, 
80-88, 90, 93, 94, 97, 99-125, 127, 132, 133, 135-138, 140, and 242-377 which encode 
a mature protein. In one aspect of this embodiment, the nucleic acid is recombinant. 
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Yet another embodiment of the present mvention is a purified or isolated nucleic 
acid comprising the nucleotides of one of SEQ ID NOs: 40-140 and 242-377 which 
encode the signal peptide. In a preferred embodiment, the purified or isolated nucleic acid 
comprises the nucleotides of SEQ ID NOs: 40, 42-46, 48, 49, 51, 53, 57, 60, 62-73, 76- 
78, 80-83, 85-88, 90, 93-95, 97, 99-102, 104, 107-125, 127, 128, 130, 132, 134-140 and 
242-377 which encode the signal peptide. In one aspect of this embodiment, the nucleic 
acid is recombinant. 

Another embodiment of the present invention is a purified or isolated nucleic acid 
encoding a polypeptide having the sequence of one of the sequences of SEQ ID NOs: 141- 
241 and 378-513. 

Another embodiment of the present invention is a purified or isolated nucleic acid 
encoding a polypeptide having the sequence of a mature protein included in one of the 
sequences of SEQ ID NOs: 141-241 and 378-513. In a preferred embodiment, the 
purified or isolated nucleic acid encodes a polypeptide having the sequence of a mature 
protein included in one of the sequences of SEQ ID NOs: 141-145, 147, 149, 150, 152- 
154, 156, 157, 159-172, 176-179, 181-189, 191, 194, 195, 198, 200-226, 228, 233, 234, 
236-239, 241 and 378-513. 

Another embodiment of the present invention is a purified or isolated nucleic acid 
encoding a polypeptide having the sequence of a signal peptide included in one of the 
sequences of SEQ ID NOs: 141-241 and 378-513. In a preferred embodiment, the 
purified or isolated nucleic acid encodes a polypeptide having the sequence of a signal 
peptide included in one ofthe sequences of SEQ ID NOs: 141, 143-147, 149, 150, 152, 
154, 158, 161, 163-174, 177-179, 181-184, 186-189, 191, 194-196, 198, 200-203, 205, 
208-226, 228, 229, 231, 233, 235-241, and 378-513. 

Yet another embodiment of the present invention is a purified or isolated protein 
comprisingthe sequence of one of SEQ ID NOs: 141-241 and 378-5 13. 

Another embodiment of the present invention is a purified or isolated polypeptide 
comprising at least 10 consecutive amino acids of one of the sequences of SEQ ID NOs: 
141-241 and 378-513. In one aspect of this embodiment, the purified or isolated 



polypeptide comprises at least 15, 20, 25, 35, 50, 75, 100, 150 or 200 consecutive amino 
acids of one of the sequences of SEQ ID NOs: 141-241 and 378-513. In still another 
aspect, the purified or isolated polypeptide comprises at least 25 consecutive amino acids 
of oneofthe sequences of SEQ ID NOs: 141-241 and378-513. 

Another embodiment of the present invention is an isolated or purified polypeptide 
comprising a signal peptide of one of the polypeptides of SEQ ID NOs: 141-241 and 378- 
513. In a preferred embodiment, the isolated or purified polypeptide comprises a signal 
peptide of one of the polypeptides of SEQ ID NOs: 141, 143-147, 149, 150, 152, 154, 
158, 161, 163-174, 177-179, 181-184, 186-189, 191, 194-196, 198, 200-203, 205, 208- 
226, 228, 229, 231, 233, 235-241, and 378-513. 

Yet another embodiment of the present invention is an isolated or purified 
polypeptide comprising a mature protein of one of the polypeptides of SEQ ID NOs: 141- 
241 and 378-513. In a preferred embodiment, the isolated or purified polypeptide 
comprises a mature protein of one of the polypeptides of SEQ ID NOs: 141-145, 147, 
149, 150, 152-154, 156, 157, 159-172, 176-179, 181-189, 191, 194, 195, 198, 200-226, 
228, 233, 234, 236-239, 241 and 378-513. 

A fiirther embodiment of the present invention is a method of making a protein 
comprising one of the sequences of SEQ ID NO: 141-241 and 378-513, comprising the 
steps of obtaining a cDNA comprising one of the sequences of sequence of SEQ ID NO: 
40-140 and 242-377, inserting the cDNA in an expression vector such that the cDNA is 
operably linked to a promoter, and introducing the expression vector into a host cell 
whereby the host cell produces the protein encoded by said cDNA. In one aspect of this 
embodiment, the method fiirther comprises the step of isolating the protein. 

Another embodiment of the present invention is a protein obtainable by the 
method described in the preceding paragraph. 

Another embodiment of the present invention is a method of making a protein 
comprising the amino acid sequence of the mature protein contained in one of the 
sequences of SEQ ID NO: 141-241 and 378-513, comprising the steps of obtaining a 
cDNA comprising one of the nucleotides sequence of sequence of SEQ ID NO: 40-140 
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and 242-377 which encode for the mature protein, inserting the cDNA in an expression 
vector such that the cDNA is operably Hnked to a promoter, and introducing the 
expression vector into a host cell whereby the host cell produces the mature protein 
encoded by the cDNA. In one aspect of this embodiment, the method further comprises 
5 the step of isolating the protein. 

Another embodiment of the present invention is a mature protein obtainable by the 
method described in the preceding paragraph. 

In a preferred embodiment, the above method comprises a method of making a 
protein comprising the amino acid sequence of the mature protein contained in one of the 
10 sequences of SEQ ID NO: 141-145, 147, 149, 150, 152-154, 156, 157, 159-172, 176- 
179, 181-189, 191, 194, 195, 198, 200-226, 228, 233, 234, 236-239, 241 and 378-513, 
comprising the steps of obtaining a cDNA comprising one of the nucleotides sequence of 
sequence of SEQ ID NO: 40-44, 46, 48, 49, 51-53, 55, 56, 58-72, 75-78, 80-88, 90, 93, 
94, 97, 99-125, 127, 132, 133, 135-138, 140, and 242-377 which encode for the mature 
1 5 protein, inserting the cDNA in an expression vector such that the cDNA is operably linked 
to a promoter, and introducing the expression vector into a host cell whereby the host cell 
produces the mature protein encoded by the cDNA. In one aspect of this embodiment, the 
method further comprises the step of isolating the protein. 

Another embodiment of the present invention is a host cell containing the purified 
20 or isolated nucleic acids comprising the sequence of one of SEQ ID NOs: 40-140 and 242- 
377 or a sequence complementary thereto described herein. 

Another embodiment of the present invention is a host cell containing the purified 
or isolated nucleic acids comprising the full coding sequences of one of SEQ ID NOs: 40- 
140 and 242-377, wherein the full coding sequence comprises the sequence encoding 
25 signal peptide and the sequence encoding mature protein described herein. 

Another embodiment of the present invention is a host cell containing the purified 
or isolated nucleic acids comprising the nucleotides of one of SEQ ID NOs: 40-140 and 
242-377 which encode a mature protein which are described herein. Preferably, the host 
cell contains the purified or isolated nucleic acids comprising the nucleotides of one of 



SEQ ID NOs: 40-44, 46, 48, 49, 51-53, 55, 56, 58-72, 75-78, 80-88, 90, 93, 94, 97, 99- 
125, 127, 132, 133, 135-138, 140, and 242-377 which encode a mature protein. 

Another embodiment of the present invention is a host cell containing the purified 
or isolated nucleic acids comprising the nucleotides of one of SEQ ID NOs: 40-140 and 
242-377 which encode the signal peptide which are described herein. Preferably, the host 
cell contains the purified or isolated nucleic acids comprising the nucleotides of one of 
SEQ ID Nos.: 40, 42-46, 48, 49, 51, 53, 57, 60, 62-73, 76-78, 80-83, 85-88, 90, 93-95, 
97, 99-102, 104, 107-125, 127, 128, 130, 132, 134-140 and 242-377 which encode the 
signal peptide. 

Another embodiment of the present invention is a purified or isolated antibody 
capable of specifically binding to a protein having the sequence of one of SEQ ID NOs: 
141-241 and 378-513. In one aspect of this embodiment, the antibody is capable of 
binding to a polypeptide comprising at least 10 consecutive amino acids of the sequence 
ofoneof SEQIDNOs: 141-241 and 378-513. 

Another embodiment of the present invention is an array of cDNAs or fiagments 
thereof of at least 1 5 nucleotides in length which includes at least one of the sequences of 
SEQ ID NOs: 40-140 and 242-377, or one of the sequences complementary to the 
sequences of SEQ ID NOs: 40-140 and 242-377, or a fragment thereof of at least 15 
consecutive nucleotides. In one aspect of this embodiment, the array includes at least two 
of the sequences of SEQ ID NOs: 40-140 and 242-377, the sequences complementary to 
the sequences of SEQ ID NOs: 40-140 and 242-377, or Augments thereof of at least 15 
consecutive nucleotides. In another aspect of this embodiment, the array includes at least 
five of the sequences of SEQ ID NOs: 40-140 and 242-377, the sequences complementary 
to the sequences of SEQ ID NOs: 40-140 and 242-377, or fragments thereof of at least 15 
consecutive nucleotides. 

A further embodiment of the invention encompasses purified polynucleotides 
comprising an insert from a clone deposited in a deposit having an accession number 
selected from the group consisting of the accession numbers listed in Table VI or a 
fragment thereof comprising a contiguous span of at least 8, 10, 12, 15, 20, 25, 40, 60, 
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100, or 200 nucleotides of said insert. An additional embodiment of the invention 
encompasses purified polypeptides which comprise, consist of, or consist essentially of an 
amino acid sequence encoded by the insert from a clone deposited in a deposit having an 
accession number selected from the group consisting of the accession numbers Usted in 
5 Table VI, as v^ell as polypeptides vv^hich comprise a fragment of said amino acid sequence 
consisting of a signal peptide, a mature protein, or a contiguous span of at least 5, 8, 10, 
12, 1 5, 20, 25, 40, 60, 100, or 200 amino acids encoded by said insert. 

An additional embodiment of the invention encompasses purified polypeptides 
which comprise a contiguous span of at least 5, 8, 10, 12, 15, 20, 25, 40, 60, 100, or 200 
10 amino acids of SEQ ID NOs: 158, 174, 175, 196, 226, 231, 232, wherein said contiguous 
span comprises at least one of the amino acid positions which was not shown to be 
identical to a public sequence in any of Figures 11 to 15. Also encompassed by the 
invention are purified polynuculeotides encoding said polypeptides. 

Brief Description of the Drawings 
15 Figure 1 is a summary of a procedure for obtaining cDNAs which have been 

selected to include the 5' ends of the mRNAs from which they are derived. 

Figure 2 is an analysis of the 43 amino terminal amino acids of all human 
Sv^ssProt proteins to determine the frequency of false positives and false negatives using 
the techniques for signal peptide identification described herein. 
20 Figure 3 shows the distribution of von Heijne scores for 5' ESTs in each of the 

categories described herein and the probability that these 5' ESTs encode a signal peptide. 

Figure 4 shows the distribution of 5' ESTs in each category and the number of 5' 
ESTs in each category having a given minimum von Heijne's score. 

Figure 5 shows the tissues from which the mRNAs corresponding to the 5' ESTs 
25 in each of the categories described herein were obtained. 

Figure 6 illustrates a method for obtaining extended cDN As. 
Figure 7 is a map of pED6dpc2. 

Figure 8 provides a schematic description of the promoters isolated and the way 
they are assembled v^th the corresponding 5' tags. 
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Figure 9 describes the transcription factor binding sites present in each of these 
promoters. 

Figure 10 is an alignment of the protein of SEQ ID NO: 217 with the human 
protein TFAR19 that may play a role in apoptosis (Genbank accession nvimber 
5 AF014955, SEQ ID NO: 516). 

Figure 1 1 is an alignment of the proteins of SEQ ID NOs: 174, 175 and 232 with 
a human secreted protein (Genseq accession number W36955, SEQ ID NO: 517). 

Figure 12 is an alignment of the protein of SEQ ID NO: 231 with the human 
E25 protein (Genbank accession number AF038953, SEQ ID NO: 515). 
10 Figure 13 is an alignment of the protein of SEQ ID NO: 196 with the human 

seventransmembrane protein (Genbank accession number Yl 1395, SEQ ID NO: 518). 

Figure 14 is an alignment of the protein of SEQ ID NOs: 158 with the murine 
subunit 7a of the C0P9 complex (Genbank accession number AF071316, SEQ ID NO: 
519). 

15 Figure 15 is an alignment of the protein of SEQ ID NO: 226 with the bovine 

subunit B14.5B of the NADH-ubiquinone oxidureductase complex (Arizmendi et al, 
FEES Lett, 313 : 80-84 (1992) and Swissprot accession -number Q02827, SEQ ID NO: 
514). 

Detailed Description of the Preferred Embodiment 
20 I. Obtaining 5' ESTs 

The present extended cDNAs were obtained using 5' ESTs which were isolated as 
described below, 

A. Chemical Methods for Obtaining mRNAs having Intact 5^ Ends 
In order to obtam the 5' ESTs used to obtain the extended cDNAs of the present 
25 invention, mRNAs having intact 5' ends must be obtained. Currently, there are two 
approaches for obtaining such mRNAs. One of these approaches is a chemical 
modification method involving derivatization of the 5' ends of the mRNAs and selection 
of the derivatized mRNAs. The 5' ends of eucaryotic mRNAs possess a structure referred 
to as a "cap" which comprises a guanosine methylated at the 7 position. The cap is joined 
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to the first transcribed base of the mRNA by a 5', 5'-triphosphatebond. In some instances, 
the 5' guanosine is methylated in both the 2 and 7 positions. Rarely, the 5' guanosine is 
trimethylated at the 2, 7 and 7 positions. In the chemical method for obtaining mRNAs 
having intact 5' ends, the 5' cap is specifically derivatized and coupled to a reactive group 
5 on an immobilizing substrate. This specific derivatization is based on the fact that only 
the ribose linked to the methylated guanosine at the 5' end of the mRNA and the ribose 
linked to the base at the 3' terminus of the mRNA, possess 2', 3'-cis diols. Optionally, 
where the 3' terminal ribose has a 2', 3'-cis diol, the 2', 3'-cis diol at the 3' end may be 
chemically modified, substituted, converted, or eliminated, leaving only the ribose linked 

10 to the methylated guanosine at the 5' end of the mRNA with a 2', 3'-cis diol. A variety of 
techniques are available for eliminating the 2', 3'-cis diol on the 3' terminal ribose. For 
example, controlled alkaline hydrolysis may be used to generate mRNA fragments in 
which the 3' terminal ribose is a 3'-phosphate, 2'-phosphate or (2', 3')-cyclophosphate. 
Thereafter, the fragment which includes the original 3' ribose may be eliminated from the 

15 mixture through chromatography on an oligo-dT column. Alternatively, a base which 
lacks the 2', 3'-cis diol may be added to the 3' end of the mRNA using an RNA ligase 
such as T4 RNA ligase. Example 1 below describes a method for ligation of pCp to the 3' 
end of messenger RNA. 

EXAMPLE 1 

20 Ligation of the Nucleoside Diphosphate pCp to the 3' End of Messenger RNA 

1 jag of RNA was incubated in a final reaction medium of 10 ^il in the presence of 
5 U of T4 phage RNA ligase in the buffer provided by the manufacturer (Gibco - BRL), 40 
U of the RNase inhibitor RNasin (Promega) and, 2 ^il of ^^Cp (Amersham#PB 10208). 
The incubation was performed at 3 7°C for 2 hours or overnight at 7-8°C. 

25 Following modification or elimination of the 2', 3'-cis diol at the 3' ribose, the 2', 

3'-cis diol present at the 5' end of the mRNA may be oxidized using reagents such as 
NaBH^, NaBHjCN, or sodium periodate, thereby converting the 2', 3'-cis diol to a 
dialdehyde. Example 2 describes the oxidation of the 2', 3'-cis diol at the 5' end of the 
mRNA with sodium periodate. 
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EXAMPLE 2 

Oxidation of 2'. 3'-cis diol at the 5' End of the mRNA 
0. 1 OD unit of either a capped oUgoribonucleotide of 47 nucleotides (including the 
cap) or an uncapped oUgoribonucleotide of 46 nucleotides were treated as follows. The 
5 oligoribonucleotides were produced by in vitro transcription using the transcription kit 
"AmpliScribe T7" (Epicentre Technologies). As indicated below, the DNA template for 
the RNA transcript contained a single cytosine. To synthesize the uncapped RNA, all four 
NTPs were included in the in vitro transcription reaction. To obtam the capped RNA, 
GTP was replaced by an analogue of the cap, m7G(5')ppp(5')G. This compound, 
10 recognized by polymerase, was incorporated into the 5' end of the nascent transcript 
during the step of initiation of transcription but was not capable of incorporation during 
the extension step. Consequently, the resulting RNA contained a cap at its 5' end. The 
sequences of the oligoribonucleotidesproduced by the in vitro transcription reaction were: 
+Cap: 

15 5'm7GpppGCAUCCUACUCCCAUCCAAUUCCACCCUAACUCCUCCCAUCUCCA 
C-3' (SEQIDNOrl) 
-Cap: 

5'-pppGCAUCCUACUCCCAUCCAAUUCCACCCUAACUCCUCCCAUCUCCAC-3' 
(SEQIDN0:2) 

20 The oligoribonucleotides were dissolved in 9 |al of acetate buffer (0.1 M sodium 

acetate, pH 5.2) and 3 ]xl of freshly prepared 0.1 M sodium periodate solution. The 
mixture was incubated for 1 hour in the dark at 4°C or room temperature. Thereafter, the 
reaction was stopped by adding 4 |j,l of 10% ethylene glycol. The product was ethanol 
precipitated, resuspended in 10|al or more of water or appropriate buffer and dialyzed 

25 against water. 

The resulting aldehyde groups may then be coupled to molecules having a reactive 
amine group, such as hydrazine, carbazide, thiocarbazide or semicarbazide groups, in 
order to facilitate enrichment of the 5' ends of the mRNAs. Molecules having reactive 
amine groups which are suitable for use in selecting mRNAs having intact 5' ends include 



avidin, proteins, antibodies, vitamins, ligands capable of specifically binding to receptor 
molecules, or oligonucleotides. Example 3 below describes the coupling of the resulting 
dialdehydeto biotin. 

EXAMPLES 

5 Coupling of the Dialdehvde with Biotin 

The oxidation product obtained in Example 2 was dissolved in 50 ixl of sodium 
acetate at a pH of between 5 and 5.2 and 50 jxl of freshly prepared 0.02 M solution of 
biotin hydrazide in a methoxyethanol/watermixture (1:1) of formula: 



O O 
NH^NH-lt— (CH2)n-NH-c!:- 




In the compound used in these experiments, n=5. However, it will be appreciated 
that other commercially available hydrazides may also be used, such as molecules of the 
formula above in which n varies from 0 to 5. 

The mixture was then incubated for 2 hours at 37°C. Following the incubation, the 
mixture was precipitated with ethanol and dialyzed against distilled water. 

Example 4 demonstrates the specificity of the biotinylationreaction. 

EXAMPLE 4 
Specificitv of Biotinvlation 

The specificity of the biotinylation for capped mRNAs was evaluated by gel 
electrophoresis of the following samples: 

Sample 1 . The 46 nucleotide uncapped in vitro transcript prepared as in Example 
2 and labeled with ^^pCp as described in Example 1 . 
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Sample 2. The 46 nucleotide uncapped in vitro transcript prepared as in Example 
X labeled with ^^pCp as described in Example 1, treated with the oxidation reaction of 
Example 2, and subjected to the biotinylation conditions of Example 3, 

Sample 3. The 47 nucleotide capped in vitro transcript prepared as in Example 2 
5 and labeled with ^^pCp as described in Example 1 . 

Sample 4. The 47 nucleotide capped in vitro transcript prepared as in Example 2, 
labeled with ^^pCp as described in Example 1, treated with the oxidation reaction of 
Example 2, and subjected to the biotinylation conditions of Example 3. 

Samples 1 and 2 had indentical migration rates, demonstrating that the uncapped 
1 0 RNAs were not oxidized and biotinylated. Sample 3 migrated more slowly than Samples 
1 and 2, while Sample 4 exhibited the slowest migration. The difference in migration of 
the RNAs in Samples 3 and 4 demonstrates that the capped RNAs were specifically 
biotinylated. 

In some cases, mRNAs having intact 5' ends may be enriched by binding the 
1 5 molecule containing a reactive amine group to a suitable solid phase substrate such as the 
inside of the vessel containing the mRNAs, magnetic beads, chromatography matrices, or 
nylon or nitrocellulose membranes. For example, where the molecule having a reactive 
amine group is biotin, the solid phase substrate may be coupled to avidin or streptavidin. 
Alternatively, where the molecule having the reactive amine group is an antibody or 
20 receptor ligand, the solid phase substrate may be coupled to the cognate antigen or 
receptor. Finally, where the molecule having a reactive amine group comprises an 
oligonucleotide, the solid phase substrate may comprise a complementary oligonucleotide. 

The mRNAs having intact 5' ends may be released from the solid phase following 
the enrichment procedure. For example, where the dialdehyde is coupled to biotin 
25 hydrazide and the solid phase comprises streptavidin, the mRNAs may be released fi*om 
the solid phase by simply heating to 95 degrees Celsius in 2% SDS. In some methods, the 
molecule having a reactive amine group may also be cleaved from the mRNAs having 
intact 5' ends following enrichment. Example 5 describes the capture of biotinylated 
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mRNAs with streptavidin coated beads and the release of the biotinylated mRNAs from 
the beads following enrichment. 

EXAMPLES 

Capture and Release of Biotinylated mRNAs Using Strepatividin Coated Beads 
5 The streptavidin-coated magnetic beads were prepared according to the 

manufacturer's instructions (CPG Inc., USA). The biotinylated mRNAs were added to a 
hybridization buffer ( 1 .5 M NaCl, pH 5 - 6). After incubating for 30 minutes, the unbound 
and nonbiotinylated material was removed. The beads were washed several times in water 
v^th 1% SDS. The beads obtained were incubated for 15 minutes at 95°C in water 
10 containing 2% SDS. 

Example 6 demonstrates the efficiency with which biotinylated mRNAs were 
recovered from the streptavidin coated beads. 

EXAMPLE 6 
Efficiencv of Recovery of Biotinvlated mRNAs 
15 The efficiency of the recovery procedure was evaluated as follows. RNAs were 

labeled wdth ^^pCp, oxidized, biotinylated and bound to streptavidin coated beads as 
described above. Subsequently , the bound RNAs were incubated for 5, 15 or 30 minutes 
at 95°C in the presence of 2% SDS. 

The products of the reaction were analyzed by electrophoresis on 12% 
20 polyacrylamide gels under denaturing conditions (7 M urea). The gels were subjected to 
autoradiography. During this manipulation, the hydrazone bonds were not reduced. 

Increasing amounts of nucleic acids were recovered as incubation times in 2% 
SDS increased, demonstrating that biotinylated mRNAs were efficiently recovered. 

In an altemative method for obtaining mRNAs having intact 5' ends, an 
25 oligonucleotide which has been derivatized to contain a reactive amine group is 
specifically coupled to mRNAs having an intact cap. Preferably, the 3' end of the mRNA 
is blocked prior to the step in which the aldehyde groups are joined to the derivatized 
oligonucleotide, as described above, so as to prevent the derivatized oligonucleotide from 
being joined to the 3' end of the mRNA. For example, pCp may be attached to the 3' end 
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of the mRNA using T4 RNA ligase. However, as discussed above, blocking the 3' end of 
the mRNA is an optional step. Derivatized oligonucleotides may be prepared as described 
below in Example 7. 

EXAMPLE? 

Derivatizationof the Oligonucleotide 

An oligonucleotide phosphorylated at its 3' end was converted to a 3' hydrazide in 
3' by treatment with an aqueous solution of hydrazine or of dihydrazide of the formula 
H2N(R1)NH2 at about 1 to 3 M, and at pH 4.5, in the presence of a carbodiimide type 
agent soluble in water such as l-ethyl-3-(3-dimethylaminopropyl)carbodiimide at a final 
concentration of 0.3 M at a temperature of 8^C overnight 

The derivatized oligonucleotide was then separated from the other agents and 
products using a standard technique for isolating oKgonucleotides. 

As discussed above, the mRNAs to be enriched may be treated to eliminate the 3' 
OH groups which may be present thereon. This may be accomplished by enzymatic 
ligation of sequences lacking a 3' OH, such as pCp, as described above in Example 1. 
Alternatively, the 3' OH groups may be eliminated by alkaline hydrolysis as described in 
Example 8 below. 

EXAMPLES 

Alkaline Hydrolysis of mRNA 
The mRNAs may be treated with alkaline hydrolysis as follows. In a total volume 
of lOOjil of O.IN sodium hydroxide, 1.5|Lig mRNA is incubated for 40 to 60 minutes at 
4°C. The solution is neutralized with acetic acid and precipitated v^th ethanol. 

Following the optional elimination of the 3' OH groups, the diol groups at the 5' 
ends of the mRNAs are oxidized as described below in Example 9. 

EXAMPLE 9 
Oxidation of Diols 

Up to 1 OD unit of RNA was dissolved in 9 \xl of buffer (0.1 M sodium acetate, 
pH 6-7 or water) and 3 |li1 of freshly prepared 0.1 M sodium periodate solution. The 
reaction was incubated for 1 h in the dark at 4^*0 or room temperature. Following the 
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incubation, the reaction was stopped by adding 4 of 10% ethylene glycol. Thereafter 
the mixture was incubated at room temperature for 15 minutes. After ethanol 
precipitation, the product was resuspended in lOjiiI or more of water or appropriate buffer 
and dialyzed against water. 

Following oxidation of the diol groups at the 5' ends of the mRNAs, the 
derivatized oligonucleotide was joined to the resulting aldehydes as described in 
Example 10. 

EXAMPLE 10 

Reaction of Aldehydes with Derivatized Oligonucleotides 
The oxidized mRNA was dissolved in an acidic medium such as 50 |il of sodium 
acetate pH 4-6. 50 )al of a solution of the derivatized oligonucleotide was added such that 
an mRNA:derivatized oligonucleotide ratio of 1 :20 was obtained and mixture was reduced 
with a borohydride. The mixture was allowed to incubate for 2 h at 3TC or overnight 
(14 h) at 10°C. The mixture was ethanol precipitated, resuspended in 10|al or more of 
water or appropriate buffer and dialyzed against distilled water. If desired, the resulting 
product may be analyzed using acrylamide gel electrophoresis, HPLC analysis, or other 
conventional techniques. 

FoUov^ng the attachment of the derivatized oligonucleotide to the mRNAs, a 
reverse transcription reaction may be performed as described in Example 1 1 below. 

EXAMPLE 11 
Reverse Transcription of mRNAs 
An oligodeoxyribonucleotide was derivatized as follows. 3 OD units of an 
oligodeoxyribonucleotideof sequence ATCAAGAATTCGCACGAGACCATTA (SEQ 
ID N0:3) having 5'-0H and 3'-P ends were dissolved in 70 |ul of a 1.5 M 
hydroxybenzotriazole solution, pH 5.3, prepared in dimethylformamide/water (75:25) 
containing 2 |iig of l-ethyl-3-(3-dimethylaminopropyl)carbodiimide.The mixture was 
incubated for 2 h 30 min at 22^C. The mixture was then precipitated twice in 
LiClO/acetone. The pellet was resuspended in 200 ^1 of 0.25 M hydrazine and incubated 
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at S'^C from 3 to 14 h. Following the hydrazine reaction, the mixture was precipitated 
twice in LiClO^acetone. 

The messenger RNAs to be reverse transcribed were extracted from blocks of 
placenta having sides of 2 cm which had been stored at -80°C. The mRNA was extracted 
5 using conventional acidic phenol techniques. Oligo-dT chromatography was used to 
purify the mRNAs. The integrity of the mRNAs was checked by Northern-blotting. 

The diol groups on 7 |ag of the placental mRNAs were oxidized as described 
above in Example 9. The derivatized oligonucleotide was joined to the mRNAs as 
described in Example 10 above except that the precipitation step was replaced by an 
10 exclusion chromatography step to remove derivatized oligodeoxyribonucleotides which 
were not joined to mRNAs. Exclusion chromatography was performed as follows: 

10 ml of AcA34 (BioSepra#230151)gel were equilibrated in 50 ml of a solution 
of 10 mM Tris pH 8.0, 300 mM NaCl, 1 mM EDTA, and 0.05% SDS. The mixture was 
allowed to sediment. The supernatant was eliminated and the gel was resuspended in 
15 50 ml of buffer. This procedure was repeated 2 or 3 times. 

A glass bead (diameter 3 mm) was introduced into a 2 ml disposable pipette 
(length 25 cm). The pipette was filled with the gel suspension until the height of the gel 
stabilized at 1 cm from the top of the pipette. The column was then equilibrated with 20 
ml of equilibration buffer (1 0 mM Tris HCl pH 7.4, 20 mM NaCl). 
20 10 jal of the mRNA which had been reacted with the derivatized oligonucleotide 

were mixed in 39 jiil of 10 mM urea and 2 jal of blue-glycerol buffer, which had been 
prepared by dissolving 5 mg of bromophenol blue in 60% glycerol (v/v), and passing the 
mixture through a filter with a filter of diameter 0.45 jam. 

The column was loaded. As soon as the sample had penetrated, equilibration 
25 buffer was added. 1 00 |li1 fractions were collected. Derivatized oligonucleotide which had 
not been attached to mRNA appeared in fraction 16 and later fractions. Fractions 3 to 15 
were combined and precipitated with ethanol. 

The mRNAs which had been reacted with the derivatized oligonucleotide were 
spotted on a nylon membrane and hybridized to a radioactive probe using conventional 
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techniques. The radioactive probe used in these hybridizations was an 
oligodeoxyribonucleotideof sequence TAATGGTCTCGTGCGAATTCTTGAT(SEQ ID 
N0;4) which was anticomplementary to the derivatized oligonucleotide and was labeled at 
its 5' end with ^^P. 1/1 0th of the mRNAs which had been reacted with the derivatized 
5 oligonucleotide was spotted in two spots on the membrane and the membrane was 
visualized by autoradiography after hybridization of the probe. A signal was observed, 
indicating that the derivatized oligonucleotide had beenjoinedto the mRNA. 

The remaining 9/10 of the mRNAs which had been reacted with the derivatized 
oligonucleotide was reverse transcribed as follows. A reverse transcription reaction was 
1 0 carried out with reverse transcriptase following the manufacturer's instructions. To prime 
the reaction, 50 pmol of nonamers with random sequence were used. 

A portion of the resulting cDNA was spotted on a positively charged nylon 
membrane using conventional methods. The cDNAs were spotted on the membrane after 
the cDNA:RNA heteroduplexes had been subjected to an alkaline hydrolysis in order to 
15 eliminate the RNAs. An oligonucleotide having a sequence identical to that of the 
derivatized oligonucleotide was labeled at its 5' end with ^^P and hybridized to the cDNA 
blots using conventional techniques. Single-stranded cDNAs resulting from the reverse 
transcription reaction were spotted on the membrane. As controls, the blot contained 
1 pmol, 100 finol, 50 fmol, 10 finol and 1 finol respectively of a control 
20 oligodeoxyribonucleotideof sequence identical to that of the derivatized oligonucleotide. 
The signal observed in the spots containing the cDNA indicated that approximately 15 
fmol of the derivatized oligonucleotide had been reverse transcribed. 

These results demonstrate that the reverse transcription can be performed through 
the cap and, in particular, that reverse transcriptase crosses the 5'-P-P-P-5' bond of the cap 
25 of eukaryotic messenger RNAs. 

The single stranded cDNAs obtained after the above first strand synthesis were 
used as template for PGR reactions. Two types of reactions were carried out. First, 
specific amplification of the mRNAs for the alpha globin, dehydrogenase, ppl5 and 
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elongation factor E4 were carried out using the following pairs of 
oligodeoxyribonucleotideprimers. 
alpha-globin 

GLO-S: CCG ACA AGA CCA ACG TCA AGG CCG C (SEQ ID N0:5) 
GLO-As: TCA CCA GCA GGC AGT GGC TTA GGA G 3' (SEQ ID N0:6) 
dehydrogenase 

3 DH-S: AGT GAT TCC TGC TAC TTT GGA TGG C (SEQ ID N0:7) 
3 DH-As: GCT TGG TCT TGT TCT GGA GTT TAG A (SEQ ID N0:8) 
ppl5 

PPl 5-S: TCC AGA ATG GGA GAC AAG CCA ATT T (SEQ ID N0:9) 
PPl 5-As: AGG GAG GAG GAA ACA GCG TGA GTC C (SEQ ID NO: 10) 

Elongation factor E4 
EFAl-S: ATG GGA AAG GAA AAG ACT CAT ATC A (SEQ ID NO: 1 1) 
EFl A-As: AGC AGC AAC AAT CAG GAC AGC ACA G (SEQ ID NO: 12) 

Non specific amplifications were also carried out with the antisense (_As) 
oligodeoxyribonucleotides of the pairs described above and a primer 
chosen fi-om the sequence of the derivatized oligodeoxyribonucleotide 
(ATCAAGAATTCGCACGAGACCATTA)(SEQ ID NO: 13). 

A 1.5% agarose gel containing the following samples corresponding to the PCR 
products of reverse transcription was stained with ethidium bromide. (l/20th of the 
products of reverse transcription were used for each PCR reaction). 

Sample 1: The products of a PCR reaction using the globin primers of SEQ ID 
NOs 5 and 6 in the presence of cDNA. 

Sample 2: The products of a PCR reaction using the globin primers of SEQ ID 
NOs 5 and 6 in the absence of added cDNA. 

Sample 3: The products of a PCR reaction using the dehydrogenase primers of 
SEQ ID NOs 7 and 8 in the presence of cDNA. 

Sample 4: The products of a PCR reaction using the dehydrogenase primers of 
SEQ ID NOs 7 and 8 in the absence of added cDNA. 
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Sample 5: The products of a PGR reaction using the ppl5 primers of SEQ ID 
NOs 9 and 10 in the presence of cDNA. 

Sample 6: The products of a PGR reaction using the ppl 5 primers of SEQ ID NOs 
9 and 1 0 in the absence of added cDNA. 
5 Sample 7: The products of a PGR reaction using the EIE4 primers of SEQ ID NOs 

1 1 and 12 in the presence of added cDNA. 

Sample 8 : The products of a PGR reaction using the EIE4 primers of SEQ ID NOs 
1 1 and 12 in the absence of added cDNA. 

In Samples 1, 3, 5 and 7, a band of the size expected for the PGR product was 
1 0 observed, indicating the presence of the corresponding sequence in the cDNA population. 

PGR reactions were also carried out with the antisense oligonucleotides of the 
globin and dehydrogenase primers (SEQ ID NOs 6 and 8) and an oligonucleotide whose 
sequence corresponds to that of the derivatized oligonucleotide. The presence of PGR 
products of the expected size in the samples corresponding to samples 1 and 3 above 
1 5 indicated that the derivatized oligonucleotide had been incorporated 

The above examples summarize the chemical procedure for enriching mRNAs for 
those having intact 5' ends. Further detail regarding the chemical approaches for 
obtaining mRNAs having intact 5' ends are disclosed in International Application No. 
W096/34981, published November 7, 1996, which is incorporated herein by reference. 
20 Strategies based on the above chemical modifications to the 5' cap structure may 

be utilized to generate cDNAs which have been selected to include the 5' ends of the 
mRNAs from which they are derived. In one version of such procedures, the 5' ends of 
the mRNAs are modified as described above. Thereafter, a reverse transcription reaction 
is conducted to extend a primer complementary to the mRNA to the 5' end of the mRNA. 
25 Single stranded RNAs are eliminated to obtain a population of cDNA/mRNA 
heteroduplexes in which the mRNA includes an intact 5' end. The resulting 
heteroduplexes may be captured on a solid phase coated with a molecule capable of 
interacting with the molecule used to derivatize the 5' end of the mRNA. Thereafter, the 
strands of the heteroduplexes are separated to recover single stranded first cDNA strands 



which include the 5' end of the mRNA. Second strand cDNA synthesis may then proceed 
using conventional techniques. For example, the procedures disclosed in WO 96/3498 1 or 
in Caminci, P. et al. High-Efficiency FuU-Length cDNA Cloning by Biotinylated CAP 
Trapper. Genomics 37:327-336 (1996), the disclosures of which are incorporated herein 
5 by reference, may be employed to select cDNAs which include the sequence derived from 
the 5' end of the coding sequence of the mRNA. 

Following ligation of the oligonucleotide tag to the 5' cap of the mRNA, a reverse 
transcription reaction is conducted to extend a primer complementary to the mRNA to the 
5' end of the mRNA. Following elimination of the RNA component of the resulting 
1 0 heteroduplex using standard techniques, second strand cDNA synthesis is conducted with 
a primer complementary to the oligonucleotide tag. 

Figure 1 summarizes the above procedures for obtaining cDNAs which have been 
selected to include the 5' ends of the mRNAs from which they are derived. 

B. Enzymatic Methods for Obtaining mRNAs having Intact 5^ Ends 
1 5 Other techniques for selecting cDNAs extending to the 5' end of the mRNA from 

which they are derived are fiiUy enzymatic. Some versions of these techniques are 
disclosed in Dumas Milne Edwards J.B. (Doctoral Thesis of Paris VI University, Le 
clonage des ADNc complets: difficultes et perspectives nouvelles. Apports pour Tetude 
de la regulation de Texpressionde la tryptophane hydroxylase de rat, 20 Dec. 1993), EPO 
20 625572 and Kato et al Construction of a Human FuU-Length cDNA Bank. Gene 1 50:243- 
250 (1994), the disclosvires of which are incorporated herein by reference. 

Briefly, in such approaches, isolated mRNA is treated with alkaline phosphatase to 
remove the phosphate groups present on the 5' ends of uncapped incomplete mRNAs. 
Following this procedure, the cap present on full length mRNAs is enzymatically removed 
25 with a decapping enzyme such as T4 polynucleotide kinase or tobacco acid 
pyrophosphatase. An oligonucleotide, which may be either a DNA oligonucleotide or a 
DNA-RNA hybrid oligonucleotide having RNA at its 3' end, is then ligated to the 
phosphate present at the 5' end of the decapped mRNA using T4 RNA ligase. The 
oligonucleotide may include a restriction site to facilitate cloning of the cDNAs following 
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their synthesis. Example 12 below describes one enzymatic method based on the doctoral 
thesis of Dumas. 

EXAMPLE 12 

Enzymatic Approach for Obtaining 5' ESTs 

Twenty micrograms of PolyA+ RNA were dephosphorylated using Calf Intestinal 
Phosphatase (Biolabs). After a phenol chloroform extraction, the cap structure of mRNA 
was hydrolysed using the Tobacco Acid Pyrophosphatase (purified as described by 
Shinshi et al., Biochemistry 15: 2185-2190, 1976) and a hemi S'DNA/RNA-S' 
oligonucleotide having an unphosphorylatedS' end, a stretch of adenosine ribophosphate 
at the 3' end, and an EcoRI site near the 5' end was ligated to the 5T ends of mRNA using 
the T4 RNA ligase (Biolabs). Oligonucleotides suitable for use in this procedure are 
preferably 30-50 bases in length. Oligonucleotides having an unphosphorylated 5' end 
may be synthesized by adding a fluorochrome at the 5' end. The inclusion of a stretch of 
adenosine ribophosphates at the 3' end of the oligonucleotide increases ligation efficiency. 
It will be appreciated that the oligonucleotide may contain cloning sites other than EcoRI. 

Following ligation of the oligonucleotide to the phosphate present at the 5' end of 
the decapped mRNA, first and second strand cDNA synthesis may be carried out using 
conventional methods or those specified in EPO 625,572 and Kato et al. Construction of a 
Human Full-Length cDNA Bank. Gene 150:243-250 (1994), and Dumas Milne Edwards, 
supra, the disclosures of which are incorporated herein by reference. The resulting cDNA 
may then be ligated into vectors such as those disclosed in Kato et al Construction of a 
Human FuU-Length cDNA Bank. Gene 150:243-250 (1994) or other nucleic acid vectors 
known to those skilled in the art using techniques such as those described in Sambrook et 
al., Molecular Cloning: A Laboratory Manual 2d Ed., Cold Spring Harbor Laboratory 
Press, 1 989, the disclosure of which is incorporated herein by reference. 
II. Characterization of 5' ESTs 

The above chemical and enzymatic approaches for enriching mRNAs having 
intact 5' ends were employed to obtain 5' ESTs. First, mRNAs were prepared as 
described in Example 1 3 below. 
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EXAMPLE 13 

Preparation of mRNA 
Total human RNAs or PolyA+ RNAs derived from 29 different tissues were 
respectively purchased from LABIMO and CLONTECH and used to generate 44 cDNA 
5 libraries as described below. The purchased RNA had been isolated from cells or tissues 
using acid guanidium thiocyanate-phenol-chloroform extraction (Chomczyniski, P and 
Sacchi, N., Analytical Biochemistry 162:156-159, 1987). PolyA+ RNA was isolated 
from total RNA (LABIMO) by two passes of oligodT chromatography, as described by 
Aviv and Leder (Aviv, H. and Leder, P., Proc. NatL Acad. Sci. USA 69:1408-1412, 
1 0 1 972) in order to eliminate ribosomal RNA. 

The quality and the integrity of the poly A+ were checked. Northern blots 
hybridized with a globin probe were used to confirm that the mRNAs were not degraded. 
Contamination of the PolyA+ mRNAs by ribosomal sequences was checked using RNAs 
blots and a probe derived from the sequence of the 28S RNA. Preparations of mRNAs 
15 with less than 5% of ribosomal RNAs were used in library construction. To avoid 
constructing libraries with RNAs contaminated by exogenous sequences (prokaryotic or 
fimgal), the presence of bacterial 16S ribosomal sequences or of two highly expressed 
mRNAs was examined using PGR. 

Following preparation of the mRNAs, the above described chemical and/or the 
20 enzymatic procedures for enriching mRNAs having intact 5' ends discussed above were 
employed to obtain 5' ESTs from various tissues. In both approaches an oligonucleotide 
tag was attached to the cap at the 5' ends of the mRNAs. The oligonucleotide tag had an 
EcoRI site therein to facilitate later cloning procedures. 

Following attachment of the oligonucleotide tag to the mRNA by either the 
25 chemical or enzymatic methods, the integrity of the mRNA was examined by performing 
a Northem blot with 200-500ng of mRNA using a probe complementary to the 
oligonucleotide tag. 

EXAMPLE 14 

cDNA Synthesis Using mRNA Templates Having Intact 5' Ends 
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For the mRNAs joined to oligonucleotide tags using both the chemical and 
enzymatic methods, first strand cDNA synthesis was performed using reverse 
transcriptase with random nonamers as primers. In order to protect internal EcoRI sites in 
the cDNA from digestion at later steps in the procedure, methylated dCTP was used for 
5 first strand synthesis. After removal of RNA by an alkaline hydrolysis, the first strand of 
cDNA was precipitated using isopropanol in order to eliminate residual primers. 

For both the chemical and the enzymatic methods, the second strand of the cDNA 
was synthesized with a Klenow fragment using a primer corresponding to the 5'end of the 
ligated oligonucleotide described in Example 12. Preferably, the primer is 20-25 bases in 
10 length. Methylated dCTP was also used for second strand synthesis in order to protect 
intemal EcoRI sites in the cDNA from digestion during the cloning process. 

Following cDNA synthesis, the cDNAs were cloned into pBlueScript as described 
in Example 15 below. 

EXAMPLE 15 

15 Insertion of cDNAs into BlueScript 

Following second strand synthesis, the ends of the cDNA were blunted with T4 
DNA polymerase (Biolabs) and the cDNA was digested with EcoRI. Since methylated 
dCTP was used during cDNA synthesis, the EcoRI site present m the tag was the only site 
which was hemi-methylated. Consequently, only the EcoRI site in the oligonucleotide tag 

20 was susceptible to EcoRI digestion. The cDNA was then size fractionated using exclusion 
chromatography (Ac A, Biosepra). Fractions corresponding to cDNAs of more than 150 bp 
were pooled and ethanol precipitated. The cDNA was directionally cloned into the Smal 
and EcoRI ends of the phagemid pBlueScript vector (Stratagene). The ligation mixture 
was electroporatedinto bacteria and propagated under appropriate antibiotic selection. 

25 Clones containing the oligonucleotide tag attached were selected as described in 

Example 16 below. 

EXAMPLE 16 

Selection of Clones Having the Oligonucleotide Tag Attached Thereto 
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The plasmid DNAs containing 5' EST libraries made as described above were 
purified (Qiagen). A positive selection of the tagged clones was performed as follows. 
Briefly, in this selection procedure, the plasmid DNA was converted to single stranded 
DNA using gene II endonuclease of the phage Fl in combination with an exonuclease 
5 (Chang et al., Gene 127:95-8, 1993) such as exonuclease III or T7 gene 6 exonuclease. 
The resulting single stranded DNA was then purified using paramagnetic beads as 
described by Fry et al., Biotechniques, 13: 124-131, 1992. In this procedure, the single 
stranded DNA was hybridized with a biotinylated oligonucleotide having a sequence 
corresponding to the 3' end of the oligonucleotide described in Example 13. Preferably, 
10 the primer has a length of 20-25 bases. Clones including a sequence complementary to 
the biotinylated oligonucleotide were captured by incubation with streptavidin coated 
magnetic beads followed by magnetic selection. After capture of the positive clones, the 
plasmid DNA was released from the magnetic beads and converted into double stranded 
DNA using a DNA polymerase such as the ThermoSequenase obtained from Amersham 
1 5 Pharmacia Biotech. Alternatively, protocols such as the Gene Trapper kit (Gibco BRL) 
may be used. The double stranded DNA was then electroporated into bacteria. The 
percentage of positive clones having the 5' tag oligonucleotide was estimated to typically 
rank between 90 and 98% using dot blot analysis. 

Following electroporation, the libraries were ordered in 384-microtiter plates 
20 (MTP). A copy of the MTP was stored for fixture needs. Then the libraries were 
transferred into 96 MTP and sequenced as described below. 

EXAMPLE 17 
Sequencing of Inserts in Selected Clones 
Plasmid inserts were first amplified by PCR on PE 9600 thermocyclers (Perkin- 
25 Elmer), using standard SETA-A and SETA-B primers (Genset SA), AmpHTaqGold 
(Perkin-Elmer), dNTPs (Boehringer), buffer and cycling conditions as recommended by 
the Perkin-Elmer Corporation. 

PCR products were then sequenced using automatic ABI Prism 377 sequencers 
(Perkin Elmer, Applied Biosystems Division, Foster City, CA). Sequencing reactions 
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were performed using PE 9600 thermocyclers (Perkin Elmer) with standard dye-primer 
chemistry and ThermoSequenase (Amersham Life Science). The primers used were either 
T7 or 21M13 (available from Genset SA) as appropriate. The primers were labeled with 
the JOE, FAM, ROX and TAMRA dyes. The dNTPs and ddNTPs used in the sequencing 
5 reactions were purchased from Boehringer, Sequencing buffer, reagent concentrations and 
cycling conditions were as recommended by Amersham. 

Following the sequencing reaction, the samples were precipitated with EtOH, 
resuspended in formamide loading buffer, and loaded on a standard 4% acrylamide gel. 
Electrophoresis was performed for 2.5 hows at 3000V on an ABI 377 sequencer, and the 

10 sequence data were collected and analyzed using the ABI Prism DNA Sequencing 
Analysis Software, version 2.1.2. 

The sequence data from the 44 cDNA libraries made as described above were 
transferred to a proprietary database, where quality control and validation steps were 
performed. A proprietary base-caller ("Trace"), working using a Unix system 

1 5 automatically flagged suspect peaks, taking into account the shape of the peaks, the inter- 
peak resolution, and the noise level. The proprietary base-caller also performed an 
automatic trimming. Any stretch of 25 or fewer bases having more than 4 suspect peaks 
was considered unreliable and was discarded. Sequences corresponding to cloning vector 
or ligation oligonucleotides were automatically removed from the EST sequences. 

20 However, the resulting EST sequences may contain 1 to 5 bases belonging to the above 
mentioned sequences at their 5' end. If needed, these can easily be removed on a case by 
case basis. 

Thereafter, the sequences were transferred to the proprietary NETGENE™ 
Database for fiirther analysis as described below. 
25 FoUov^ng sequencing as described above, the sequences of the 5' ESTs were 

entered in a proprietary database called NETGENE™ for storage and manipulation. It 
will be appreciated by those skilled in the art that the data could be stored and manipulated 
on any medium which can be read and accessed by a computer. Computer readable media 
include magnetically readable media, optically readable media, or electronically readable 
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media. For example, the computer readable media may be a hard disc, a floppy disc, a 
magnetic tape, CD-ROM, RAM, or ROM as well as other types of other media known to 
those skilled in the art. 

In addition, the sequence data may be stored and manipulated in a variety of data 
processor programs in a variety of formats. For example, the sequence data may be stored 
as text in a word processing file, such as Microsoft WORD or WORDPERFECT or as an 
ASCII file in a variety of database programs familiar to those of skill in the art, such as 
DB2, SYBASE, or ORACLE. 

The computer readable media on which the sequence information is stored may be 
in a personal computer, a network, a server or other computer systems known to those 
skilled in the art. The computer or other system preferably includes the storage media 
described above, and a processor for accessing and manipulating the sequence data. 

Once the sequence data has been stored it may be manipulated and searched to 
locate those stored sequences which contain a desired nucleic acid sequence or which 
encode a protein having a particular fiinctional domain. For example, the stored sequence 
information may be compared to other known sequences to identify homologies, motifs 
implicated in biological fimction, or structural motifs. 

Programs which may be used to search or compare the stored sequences include 
the MacPattem (EMBL), BLAST, and BLAST2 program series (NCBI), basic local 
alignment search tool programs for nucleotide (BLASTN) and peptide (BLASTX) 
comparisons (Altschul et al, J. MoL Biol. 215: 403 (1990)) and FASTA (Pearson and 
Lipman, Proc. Natl. Acad. Sci. USA, 85: 2444 (1988)). The BLAST programs then 
extend the alignments on the basis of defined match and mismatch criteria. 

Motifs which may be detected using the above programs include sequences 
encoding leucine zippers, helix-tum-helix motifs, glycosylation sites, ubiquitination sites, 
alpha helices, and beta sheets, signal sequences encoding signal peptides which direct the 
secretion of the encoded proteins, sequences implicated in transcription regulation such as 
homeoboxes, acidic stretches, enzymatic active sites, substrate binding sites, and 
enzymatic cleavage sites. 
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Before searching the cDNAs in the NETGENE™ database for sequence motifs of 
interest, cDNAs derived from mRNAs which were not of interest were identified and 
eliminated from ftirther consideration as described in Example 1 8 below. 

EXAMPLE 18 

5 Elimination of Undesired Sequences from Further Consideration 

5' ESTs in the NETGENE''''^ database which were derived from undesired 
sequences such as transfer RNAs, ribosomal RNAs, mitochondrial RNAs, procaryotic 
RNAs, ftingal RNAs, Alu sequences, LI sequences, or repeat sequences were identified 
using the FASTA and BLASTN programs with the parameters listed in Table II. 

10 To eliminate 5' ESTs encoding tRNAs from fiirther consideration, the 5' EST 

sequences were compared to the sequences of 11 90 known tRNAs obtained from EMBL 
release 38, of which 1 00 were human. The comparison was performed using FASTA on 
both strands of the 5' ESTs. Sequences having more than 80% homology over more than 
60 nucleotides were identified as tRNA. Of the 144,341 sequences screened, 26 were 

1 5 identified as tRNAs and eliminated from fiirther consideration. 

To eliminate 5' ESTs encoding rRNAs from fiirther consideration, the 5' EST 
sequences were compared to the sequences of 2497 known rRNAs obtained from EMBL 
release 38, of which 73 were human. The comparison was performed using BLASTN on 
both strands of the 5' ESTs with the parameter S=108. Sequences having more than 80% 

20 homology over sfretches longer than 40 nucleotides were identified as rRNAs. Of the 
144,341 sequences screened, 3,3 12 were identified as rRNAs and eliminated from fiirther 
consideration. 

To eliminate 5' ESTs encoding mtRNAs from fiirther consideration, the 5' EST 
sequences were compared to the sequences of the two known mitochondrial genomes for 
25 which tiie entire genomic sequences are available and all sequences ti-anscribed from these 
mitochondrial genomes including tRNAs, rRNAs, and mRNAs for a total of 38 
sequences. The comparison was performed using BLASTN on both sti^nds of the 5' 
ESTs with the parameter S=108. Sequences having more than 80% homology over 
stretches longer than 40 nucleotides were identified as mtRNAs. Of the 144,341 
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sequences screened, 6,110 were identified as mtRNAs and eliminated from fiirther 
consideration. 

Sequences which might have resulted from exogenous contaminants were 
eliminated from further consideration by comparing the 5' EST sequences to release 46 of 
5 the EMBL bacterial and fungal divisions using BLASTN with the parameter S=144. All 
sequences having more than 90% homology over at least 40 nucleotides were identified as 
exogenous contaminants. Of the 42 cDNA libraries examined, the average percentages of 
procaryotic and fungal sequences contained therein were 0.2% and 0.5% respectively. 
Among these sequences, only one could be identified as a sequence specific to fungi. The 
10 others were either fungal or procaryotic sequences having homologies with vertebrate 
sequences or including repeat sequences which had not been masked during the electronic 
comparison. 

In addition, the 5' ESTs were compared to 6093 Alu sequences and 1115 LI 
sequences to mask 5' ESTs containing such repeat sequences from further consideration, 

15 5' ESTs including THE and MER repeats, SSTR sequences or satellite, micro-satellite, or 
telomeric repeats were also eliminated from fiirther consideration. On average, 1 1 .5% of 
the sequences in the libraries contained repeat sequences. Of this 1 1.5%, 7% contained 
Alu repeats, 3.3% contained LI repeats and the remaining 1.2% were derived from the 
other types of repetitive sequences which were screened. These percentages are consistent 

20 with those found in cDNA libraries prepared by other groups. For example, the cDNA 
libraries of Adams et al. contained between 0% and 7.4% Alu repeats depending on the 
source of the RNA which was used to prepare the cDNA library (Adams et al.. Nature 
377:174,1996). 

The sequences of those 5' ESTs remaining after the elimination of undesirable 
25 sequences were compared with the sequences of known human mRNAs to determine the 
accuracy of the sequencing procedures described above. 

EXAMPLE 19 

Measurementof Seq uencing Accuracy by Comparison to Known Sequences 
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To further determine the accuracy of the sequencing procedure described above, 
the sequences of 5' ESTs derived from known sequences were identified and compared to 
the known sequences. First, a FASTA analysis with overhangs shorter than 5 bp on both 
ends was conducted on the 5' ESTs to identify those matching an entry in the public 
human mRNA database. The 6655 5' ESTs which matched a known human mRNA were 
then realigned with their cognate mRNA and dynamic programming was used to include 
substitutions, insertions, and deletions in the list of "errors" which would be recognized. 
Errors occurring in the last 10 bases of the 5' EST sequences were ignored to avoid the 
inclusion of spurious cloning sites in the analysis of sequencing accuracy. 

This analysis revealed that the sequences incorporated in the NETGENE™ 
database had an accuracy of more than 99.5%. 

To determine the efficiency with which the above selection procedures select 
cDNAs which include the 5' ends of their corresponding mRNAs, the following analysis 
was performed. 

EXAMPLE 20 

Determination of Efficiencv of 5' EST Selection 
To determine the efficiency at which the above selection procedures isolated 5' 
ESTs which included sequences close to the 5' end of the mRNAs from which they were 
derived, the sequences of the ends of the 5' ESTs which were derived fi-om the elongation 
factor 1 subunit a and ferritin heavy chain genes were compared to the known cDNA 
sequences for these genes. Since the transcription start sites for the elongation factor 1 
subunit a and ferritin heavy chain are well characterized, they may be used to determine 
the percentage of 5' ESTs derived from these genes which included the authentic 
transcription start sites. 

For both genes, more than 95% of the cDNAs included sequences close to or 
upstream of the 5' end of the corresponding mRNAs. 

To extend the analysis of the reliability of the procedures for isolating 5' ESTs 
from ESTs in the NETGENE™ database, a similar analysis was conducted using a 
database composed of human mRNA sequences extracted from GenBank database release 
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97 for comparison. For those 5' ESTs derived from mRNAs included in the GeneBank 
database, more than 85% had their 5' ends close to the 5' ends of the known sequence. As 
some of the mRNA sequences available in the GenBank database are deduced from 
genomic sequences, a 5' end matching with these sequences will be coxmted as an internal 

5 match. Thus, the method used here underestimates the yield of ESTs including the 
authentic 5' ends of their corresponding mRNAs. 

The EST libraries made above included multiple 5' ESTs derived from tiie same 
mRNA. The sequences of such 5' ESTs were compared to one another and the longest 5' 
ESTs for each mRNA were identified. Overlapping cDNAs were assembled into 

0 continuous sequences (contigs). The resulting continuous sequences were then compared 
to public databases to gauge their similarity to known sequences, as described in Example 
21 below. 

EXAMPLE 21 

Clustering of the 5' ESTs and Calculation of Novelty Indices for cDNA Libraries 
5 For each sequenced EST library, the sequences were clustered by the 5' end. Each 

sequence in the library was compared to the others with BLASTN2 (direct strand, 
parameters S=107). ESTs with High Scoring Segment Pairs (HSPs) at least 25 bp long, 
having 95% identical bases and beginning closer than 10 bp from each EST 5' end were 
grouped. The longest sequence found in the cluster was used as representative of the 
0 cluster. A global clustering between libraries was then performed leading to the definition 
of super-contigs. 

To assess the yield of new sequences within the EST libraries, a novelty rate (NR) 
was defined as: NR= 1 00 X (Number of new vinique sequences found in the library/Total 
number of sequences from the library). Typically, novelty rating range between 10% and 
5 4 1 % depending on the tissue from which the EST library was obtained. For most of the 
libraries, the random sequencing of 5' EST libraries was pursued until the novelty rate 
reached 20%. 
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Following characterization as described above, the collection of 5' ESTs in 
NETGENE™ was screened to identify those 5' ESTs bearing potential signal sequences 
as described in Example 22 below. 

EXAMPLE 22 

5 Identificationof Potential Signal Sequences in 5' ESTs 

The 5' ESTs in the NETGENE™ database were screened to identify those having 
an uninterrupted open reading frame (ORE) longer than 45 nucleotides beginning with an 
ATG codon and extending to the end of the EST, Approximately half of the cDNA 
sequences in NETGENE^m contained such an ORF. The ORFs of these 5' ESTs were 
10 searched to identify potential signal motifs using slight modifications of the procedures 
disclosed in Von Heijne, G. A New Method for Predicting Signal Sequence Cleavage 
Sites. Nucleic Acids Res. 14:4683-4690 (1986), the disclosure of which is incorporated 
herein by reference. Those 5* EST sequences encoding a 1 5 amino acid long stretch with 
a score of at least 3.5 in the Von Heijne signal peptide identification matrix were 
1 5 considered to possess a signal sequence. Those 5' ESTs which matched a known human 
mRNA or EST sequence and had a 5' end more than 20 nucleotides downstream of the 
known 5' end were excluded from further analysis. The remaining cDNAs having signal 
sequences therein were included in a database called SIGNALTAG™. 

To confirm the accuracy of the above method for identifying signal sequences, the 
20 analysis of Example 23 was performed. 

EXAMPLE 23 

Confirmation of Accuracy of Identificationof Potential Signal Sequences in 5^ ESTs 
The accuracy of the above procedure for identifying signal sequences encoding 
signal peptides was evaluated by applying the method to the 43 amino terminal amino 
25 acids of all human SwissProt proteins. The computed Von Heijne score for each protein 
was compared with the known characterization of the protein as being a secreted protein 
or a non-secreted protein. In this manner, the number of non-secreted proteins having a 
score higher than 3.5 (false positives) and the number of secreted proteins having a score 
lower than 3.5 (false negatives) could be calculated. 
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Using the results of the above analysis, the probability that a peptide encoded by 
the 5' region of the mRNA is in fact a genuine signal peptide based on its Von Heijne's 
score was calculated based on either the assumption that 10% of human proteins are 
secreted or the assumption that 20% of human proteins are secreted. The results of this 
5 analysis are shown in Figures 2 and 3 . 

Using the above method of identifying secretory proteins, 5' ESTs for human 
glucagon, gamma interferon induced monokine precursor, secreted cyclophilin-like 
protein, human pleiotropin, and human biotinidase precursor all of which are polypeptides 
which are known to be secreted, were obtained. Thus, the above method successfully 

1 0 identified those 5' ESTs which encode a signal peptide. 

To confirm that the signal peptide encoded by the 5' ESTs actually functions as a 
signal peptide, the signal sequences from the 5' ESTs may be cloned into a vector 
designed for the identification of signal peptides. Some signal peptide identification 
vectors are designed to confer the ability to grow in selective medium on host cells which 

1 5 have a signal sequence operably inserted into the vector. For example, to confirm that a 5' 
EST encodes a genuine signal peptide, the signal sequence of the 5' EST may be inserted 
upstream and in frame with a non-secreted form of the yeast invertase gene in signal 
peptide selection vectors such as those described in U.S. Patent No. 5,536,637, the 
disclosure of which is incorporated herein by reference. Growth of host cells containing 

20 signal sequence selection vectors having the signal sequence from the 5' EST inserted 
therein confirms that the 5' EST encodes a genuine signal peptide. 

Altematively, the presence of a signal peptide may be confirmed by cloning the 
extended cDNAs obtained using the ESTs into expression vectors such as pXTl (as 
described below), or by constructing promoter-signal sequence-reporter gene vectors 

25 which encode fusion proteins between the signal peptide and an assayable reporter protein. 
After introduction of these vectors into a suitable host cell, such as COS cells or NIH 3T3 
cells, the growth medium may be harvested and analyzed for the presence of the secreted 
protein. The medium from these cells is compared to the medium from cells containing 
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vectors lacking the signal sequence or extended cDNA insert to identify vectors which 
encode a functional signal peptide or an authentic secreted protein. 

Those 5' ESTs v^hich encoded a signal peptide, as determined by the method of 
Example 22 above, were further grouped into four categories based on their homology to 
known sequences. The categorization of the 5' ESTs is described in Example 24 below. 

EXAMPLE 24 
Categorization of 5' ESTs Encoding a Signal Peptide 
Those 5' ESTs having a sequence not matching any known vertebrate sequence 
nor any publicly available EST sequence were designated "new." Of the sequences in the 
SIGNALTAG™ database, 947 of the 5' ESTs having a Von Heijne's score of at least 3.5 
fell into this category. 

Those 5' ESTs having a sequence not matching any vertebrate sequence but 
matching a publicly known EST were designated "EST-ext", provided that the known 
EST sequence was extended by at least 40 nucleotides in the 5' direction. Of the 
sequences in the SIGNALTAG™ database, 150 of the 5' ESTs having a Von Heijne's 
score of at least 3.5 fell into this category. 

Those ESTs not matching any vertebrate sequence but matching a publicly known 
EST without extending the known EST by at least 40 nucleotides in the 5' direction were 
designated "EST." Of the sequences in the SIGNALTAG™ database, 599 of the 5' ESTs 
having a Von Heijne's score of at least 3.5 fell into this category. 

Those 5' ESTs matching a human mRNA sequence but extending the known 
sequence by at least 40 nucleotides in the 5' direction were designated "VERT-ext." Of 
the sequences in the SIGNALTAG^^^ database, 23 of the 5' ESTs having a Von Heijne's 
score of at least 3.5 fell into this category. Included in this category was a 5' EST which 
extended the known sequence of the human translocase mRNA by more than 200 bases in 
the 5' direction. A 5' EST which extended the sequence of a human tumor suppressor 
gene in the 5' direction was also identified. 

Figure 4 shows the distribution of 5' ESTs in each category and the number of 5' 
ESTs in each category having a given minimum von Heijne's score. 

-41- 



Each of the 5' ESTs was categorized based on the tissue from which its 
corresponding mRNA was obtained, as described below in Example 25. 

EXAMPLE 25 
Categorization of Expression Pattems 
5 Figure 5 shows the tissues from which the mRNAs corresponding to the 5' ESTs 

in each of the above described categories were obtained. 

In addition to categorizing the 5' ESTs by the tissue from which the cDNA library 
in which they were first identified was obtained, the spatial and temporal expression 
pattems of the mRNAs corresponding to the 5' ESTs, as well as their expression levels, 
10 may be determined as described in Example 26 below. Characterization of the spatial and 
temporal expression pattems and expression levels of these mRNAs is usefiil for 
constmcting expression vectors capable of producing a desired level of gene product in a 
desired spatial or temporal manner, as will be discussed in more detail below. 

In addition, 5' ESTs whose corresponding mRNAs are associated with disease 
15 states may also be identified. For example, a particular disease may result from lack of 
expression, over expression, or under expression of an mRNA corresponding to a 5' EST. 
By comparing mRNA expression pattems and quantities in samples taken from healthy 
individuals with those from individuals suffering from a particular disease, 5' ESTs 
responsible for the disease may be identified. 
20 It will be appreciated that the results of the above characterization procedures for 

5' ESTs also apply to extended cDNAs (obtainable as described below) which contain 
sequences adjacent to the 5' ESTs. It will also be appreciated that if it is desired to defer 
characterization until extended cDNAs have been obtained rather than characterizing the 
ESTs themselves, the above characterizationprocedures can be applied to characterize the 
25 extended cDNAs after their isolation. 

EXAMPLE 26 

Evaluation of Expression Levels and Pattems of mRNAs 
Correspondingto 5' ESTs or Extended cDNAs 
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Expression levels and patterns of mRNAs corresponding to 5' ESTs or extended 
cDNAs (obtainable as described below) may be analyzed by solution hybridization with 
long probes as described in International Patent Application No. WO 97/05277, the entire 
contents of which are hereby incorporated by reference. Briefly, a 5' EST, extended 
5 cDNA, or fragment thereof corresponding to the gene encoding the mRNA to be 
characterized is inserted at a cloning site immediately downstream of a bacteriophage (T3, 
T7 or SP6) RNA polymerase promoter to produce antisense RNA. Preferably, the 5' EST 
or extended cDNA has 1 00 or more nucleotides. The plasmid is linearized and transcribed 
in the presence of ribonucleotides comprising modified ribonucleotides (i.e. biotin-UTP 

10 and DIG-UTP). An excess of this doubly labeled RNA is hybridized in solution with 
mRNA isolated from cells or tissues of interest. The hybridizations are performed under 
standard stringent conditions (40-50°C for 16 hours in an 80% formamide, 0.4 M NaCl 
buffer, pH 7-8). The unhybridized probe is removed by digestion with ribonucleases 
specific for single-stranded RNA (i.e. RNases CL3, Tl, Phy M, U2 or A). The presence 

15 of the biotin-UTP modification enables capture of the hybrid on a microtitration plate 
coated with streptavidin. The presence of the DIG modification enables the hybrid to be 
detected and quantified by ELISA using an anti-DIG antibody coupled to alkaline 
phosphatase. 

The 5' ESTs, extended cDNAs, or fragments thereof may also be tagged with 
20 nucleotide sequences for the serial analysis of gene expression (SAGE) as disclosed in UK 
Patent Application No. 2 305 241 A, the entire contents of which are incorporated by 
reference. In this method, cDNAs are prepared from a cell, tissue, organism or other 
source of nucleic acid for which it is desired to determine gene expression patterns. The 
resulting cDNAs are separated into two pools. The cDNAs in each pool are cleaved with 
25 a first restriction endonuclease, called an "anchoring enzyme," having a recognition site 
which is likely to be present at least once in most cDNAs. The fragments which contain 
the 5' or 3' most region of the cleaved cDNA are isolated by binding to a capture medium 
such as streptavidin coated beads. A first oligonucleotide linker having a first sequence 
for hybridization of an amplification primer and an internal restriction site for a "tagging 
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endonuclease" is ligated to the digested cDNAs in the first pool. Digestion with the 
second endonuclease produces short "tag" fi-agments from the cDNAs. 

A second oligonucleotide having a second sequence for hybridization of an 
amplification primer and an internal restriction site is ligated to the digested cDNAs in the 
second pool. The cDNA fragments in the second pool are also digested with the "tagging 
endonuclease" to generate short "tag" fragments derived from the cDNAs in the second 
pool. The "tags" resulting from digestion of the first and second pools vdth the anchormg 
enzyme and the tagging endonuclease are hgated to one another to produce "ditags." In 
some embodiments, the ditags are concatamerizedto produce hgation products containing 
from 2 to 200 ditags. The tag sequences are then determined and compared to the 
sequences of the 5' ESTs or extended cDNAs to determine which 5' ESTs or extended 
cDNAs are expressed in the cell, tissue, organism, or other source of nucleic acids from 
which the tags were derived. In this way, the expression pattern of the 5' ESTs or 
extended cDNAs in the cell, tissue, organism, or other source of nucleic acids is obtained. 

Quantitative analysis of gene expression may also be performed using arrays. As 
used herein, the term array means a one dimensional, two dimensional, or 
multidimensional arrangement of frill length cDNAs (i.e. extended cDNAs which include 
the coding sequence for the signal peptide, the coding sequence for the mature protein, and 
a stop codon), extended cDNAs, 5' ESTs or fragments of the frill length cDNAs, extended 
cDNAs, or 5' ESTs of sufficient length to permit specific detection of gene expression. 
Preferably, the fragments are at least 15 nucleotides in length. More preferably, the 
fragments are at least 1 00 nucleotides in length. More preferably, the fragments are more 
than 100 nucleotides in length. In some embodiments the fragments may be more than 
500 nucleotides in length. 

For example, quantitative analysis of gene expression may be performed vdth fiill 
length cDNAs, extended cDNAs, 5' ESTs, or fragments thereof in a complementary DNA 
microarray as described by Schena et al. {Science 270:467-470, 1995; Proc. Natl. Acad 
Set U.S.A. 93:10614-10619, 1996). Full length cDNAs, extended cDNAs, 5' ESTs or 
fragments thereof are amplified by PGR and arrayed from 96-well microtiter plates onto 
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silylated microscope slides using high-speed robotics. Printed arrays are incubated in a 
humid chamber to allow rehydration of the array elements and rinsed, once in 0.2% SDS 
for 1 min, twice in water for 1 min and once for 5 min in sodium borohydride solution. 
The arrays are submerged in water for 2 min at 95°C, transferred into 0.2% SDS for 1 
5 min, rinsed twice with water, air dried and stored in the dark at 25°C. 

Cell or tissue mRNA is isolated or commercially obtained and probes are prepared 
by a single round of reverse transcription. Probes are hybridized to 1 cm^ microarrays 
under a 14 x 14 mm glass coverslip for 6-12 hours at 60°C. Arrays are washed for 5 min 
at 25°C in low stringency wash buffer (1 x SSC/0.2% SDS), then for 10 min at room 

10 temperature in high stringency wash buffer (0.1 x SSC/0.2% SDS). Arrays are scanned in 
0.1 X SSC using a fluorescence laser scanning device fitted with a custom filter set. 
Accurate differential expression measurements are obtained by taking the average of the 
ratios of two independent hybridizations. 

Quantitative analysis of the expression of genes may also be performed with ftdl 

1 5 length cDNAs, extended cDNAs, 5' ESTs, or fragments thereof in complementary DNA 
arrays as described by Pietu et al. (Genome Research 6:492-503, 1996). The full length 
cDNAs, extended cDNAs, 5' ESTs or fragments thereof are PCR amplified and spotted on 
membranes. Then, mJRNAs originating from various tissues or cells are labeled with 
radioactive nucleotides. After hybridization and washing in controlled conditions, the 

20 hybridized mRNAs are detected by phospho-imaging or autoradiography. Duplicate 
experiments are performed and a quantitative analysis of differentially expressed mRNAs 
is then performed. 

Alternatively, expression analysis of the 5' ESTs or extended cDNAs can be done 
through high density nucleotide arrays as described by Lockhart et al. (Nature 
15 Biotechnology 14: 1675-1680, 1996) and Sosnowsky et al. (Proc. Nati. Acad. Sci. 
94:1119-1123, 1997). Oligonucleotides of 15-50 nucleotides corresponding to sequences 
of the 5' ESTs or extended cDNAs are syntiiesized directly on the chip (Lockhart et al., 
supra) or synthesized and tiien addressed to the chip (Sosnowski et al., supra). Preferably, 
the oligonucleotides are about 20 nucleotides in length. 
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cDNA probes labeled with an appropriate compound, such as biotin, digoxigenin 
or fluorescent dye, are synthesized from the appropriate mRNA population and then 
randomly fragmented to an average size of 50 to 100 nucleotides. The said probes are 
then hybridized to the chip. After washing as described in Lockhart et al., supra and 
5 application of different electric fields (Sosnowsky et al., Proc. Natl. Acad. Sci. 94: 1119- 
1123)., the dyes or labeling compounds are detected and quantified. Duplicate 
hybridizations are performed. Comparative analysis of the intensity of the signal 
originating from cDNA probes on the same target oligonucleotide in different cDNA 
samples indicates a differential expression of the mRNA corresponding to the 5' EST or 
1 0 extended cDNA from which the oligonucleotide sequence has been designed. 

III. Use of 5' ESTs to Clone Extended cDNAs and to Clone the Corresponding 
Genomic DNAs 

Once 5' ESTs which include the 5' end of the corresponding mRNAs have been 
selected using the procedures described above, they can be utilized to isolate extended 

15 cDNAs which contain sequences adjacent to the 5' ESTs. The extended cDNAs may 
include the entire coding sequence of the protein encoded by the corresponding mRNA, 
including the authentic translation start site, the signal sequence, and the sequence 
encoding the mature protein remaming after cleavage of the signal peptide. Such 
extended cDNAs are referred to herem as "fidl length cDNAs." Alternatively, the 

20 extended cDNAs may include only the sequence encoding the mature protein remaining 
after cleavage of the signal peptide, or only the sequence encoding the signal peptide. 

Example 27 below describes a general method for obtaining extended cDNAs. 
Example 28 below describes the cloning and sequencing of several extended cDNAs, 
including extended cDNAs which include the entire coding sequence and authentic 5' end 

25 of the correspondingmRNA for several secreted proteins. 

The methods of Examples 27, 28, and 29 can also be used to obtain extended 
cDNAs which encode less than the entire coding sequence of the secreted proteins 
encoded by the genes corresponding to the 5' ESTs. In some embodiments, the extended 
cDN As isolated using these methods encode at least 1 0 amino acids of one of the proteins 
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encoded by the sequences of SEQ ID NOs: 40-140 and 242-377. In further embodiments, 
the extended cDNAs encode at least 20 amino acids of the proteins encoded by the 
sequences of SEQ ID NOs: 40-140 and 242-377. In further embodiments, the extended 
cDNAs encode at least 30 amino amino acids of the sequences of SEQ ID NOs: 40-140 
5 and 242-377. In a preferred embodiment, the extended cDNAs encode a full length 
protein sequence, which includes the protein coding sequences of SEQ ID NOs: 40-140 
and 242-377. 

EXAMPLE 27 

General M ethod for Using 5' ESTs to Clone and Sequence Extended cDNAs 

10 The following general method has been used to quickly and efficiently isolate 

extended cDNAs including sequence adjacent to the sequences of the 5' ESTs used to 
obtain them. This method may be applied to obtain extended cDNAs for any 5' EST in 
the NETGENE™ database, including those 5' ESTs encoding secreted proteins. The 
method is summarized in Figure 6. 

15 1. Obtaining Extended cDNAs 

a) First strand synthesis 

The method takes advantage of the known 5' sequence of the mRNA. A reverse 
transcriptionreaction is conducted on purified mRNA with a poly 1 4dT primer containing 
a 49 nucleotide sequence at its 5' end allowing the addition of a known sequence at the end 

20 of the cDNA which corresponds to the 3' end of the mRNA. For example, the primer may 
have the following sequence: 5'-ATC GTT GAG ACT CGT ACC AGC AGA GTC ACG 
AGA GAG ACT ACA CGG TAC TGG TTT TTT TTT TTT TTVN -3' (SEQ ID NO: 14). 
Those skilled in the art will appreciate that other sequences may also be added to the poly 
dT sequence and used to prime the first strand synthesis. Using this primer and a reverse 

25 transcriptase such as the Superscript 11 (Gibco BRL) or Rnase H Minus M-MLV 

(Promega) enzyme, a reverse transcript anchored at the 3' polyA site of the RNAs is 
generated. 

After removal of the mRNA hybridized to the first cDNA strand by alkaline 
hydrolysis, the products of the alkaline hydrolysis and the residual poly dT primer are 
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eliminated with an exclusion column such as an AcA34 (Biosepra) matrix as explained in 
Example 11. 

b) Second strand synthesis 

A pair of nested primers on each end is designed based on the known 5' sequence 
from the 5' EST and the known 3' end added by the poly dT primer used in the first strand 
synthesis. Software used to design primers are either based on GC content and melting 
temperatures of oligonucleotides, such as OSP (lUier and Green, PCR Metk Appl 1 : 124- 
128, 1991), or based on the octamerfrequency disparity method (Griffaiset al,,Nucleic 
Acids Res. 19: 3887-3891,1991 such as PC-Rare 
(http:/^ioinformatics.weizmann.ac.il/software/PC-Rare/doc/manuel.html). 

Preferably, the nested primers at the 5' end are separated from one another by four 
to nine bases. The 5' primer sequences may be selected to have melting temperatures and 
specificities suitable for use in PCR. 

Preferably, the nested primers at the 3 ' end are separated from one another by four 
to nine bases. For example, the nested 3' primers may have the following sequences: (5*- 
CCA GCA GAG TCA CGA GAG AGA CTA CAC GG -3XSEQ ID NO: 1 5), and 5^- 
CAC GAG AGA GAC TAC ACG GTA CTG G -3' (SEQ ID NO: 1 6). These primers 
were selected because they have melting temperatures and specificities compatible with 
their use in PCR. However, those skilled in the art will appreciate that other sequences 
may also be used as primers. 

The first PCR run of 25 cycles is performed using the Advantage Tth Polymerase 
Mix (Clontech) and the outer primer from each of the nested pairs. A second 20 cycle 
PCR using the same enzyme and the inner primer from each of the nested pairs is then 
performed on 1/2500 of the first PCR product. Thereafter, the primers and nucleotides are 
removed. 

2. Sequencing of Full Length Extended cDNAs or Fragments Thereof 

Due to the lack of position constraints on the design of 5 ' nested primers 
compatible for PCR use using the OSP software, amplicons of two types are obtained. 
Preferably, the second 5' primer is located upstream of the translation initiation codon 

-48- 



thus yielding a nested PGR product containing the whole coding sequence. Such a full 
length extended cDNA undergoes a direct cloning procedure as described in section a 
below. However, in some cases, the second 5 ' primer is located downstream of the 
translation initiation codon, thereby yielding a PGR product containing only part of the 
ORF. Such incomplete PGR products are submitted to a modified procedure described in 
section b below. 

a) Nested PGR products containing complete QRFs 

When the resulting nested PGR product contains the complete coding sequence, 
as predicted from the 5 'EST sequence, it is cloned in an appropriate vector such as 
pED6dpc2, as described in section 3. 

b) Nested PGR products containing incomplete QRFs 

When the amplicon does not contain the complete coding sequence, intermediate 
steps are necessary to obtain both the complete coding sequence and a PGR product 
containing the full coding sequence. The complete coding sequence can be assembled 
from several partial sequences determined directly from different PGR products as 
described in the following section. 

Once the full coding sequence has been completely determined, new primers 
compatible for PGR use are designed to obtain amplicons containing the whole coding 
region. However, in such cases, 3' primers compatible for PGR use are located inside 
the 3' UTR of the corresponding mRNA, thus yielding amplicons which lack part of 
this region, i.e. the polyA tract and sometimes the polyadenylation signal, as illustrated 
in figure 6. Such full length extended cDNAs are then cloned into an appropriate vector 
as described in section 3. 

c) Sequencing extended cDNAs 

Sequencing of extended cDNAs is performed using a Die Terminator approach 
with the AmpliTaq DNA polymerase FS kit available from Perkin Elmer. 

In order to sequence PGR fragments, primer walking is performed using software 
such as OSP to choose primers and automated computer software such as ASMG (Sutton 
Qtal,, Genome Science TechnoL 1: 9-19, 1 995) to construct contigs of walking sequences 
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including the initial 5' tag using minimum overlaps of 32 nucleotides. Preferably, primer 
walking is performed until the sequences of full length cDNAs are obtained. 

Completion of the sequencing of a given extended cDNA fragment is assessed as 
follows. Since sequences located after a polyA tract are difficult to determine precisely in 
5 the case of uncloned products, sequencing and primer walking processes for PGR products 
are interrupted when a polyA tract is identified in extended cDNAs obtained as described 
in case b. The sequence length is compared to the size of the nested PGR product 
obtained as described above. Due to the limited accuracy of the determination of the PGR 
product size by gel electrophoresis, a sequence is considered complete if the size of the 

1 0 obtained sequence is at least 70 % the size of the first nested PGR product. If the length of 
the sequence determined from the computer analysis is not at least 70% of the length of 
the nested PGR product, these PGR products are cloned and the sequence of the insertion 
is determined. When Northern blot data are available, the size of the mRNA detected for a 
given PGR product is used to finally assess that the sequence is complete. Sequences 

1 5 which do not fulfill the above criteria are discarded and will undergo a new isolation 
procedure. 

Sequence data of all extended cDNAs are then transferred to a proprietary 
database, where quality controls and validation steps are carried out as described in 
example 15. 

20 3. Gloning of Full Length Extended cDNAs 

The PGR product containing the full coding sequence is then cloned in an 
appropriate vector. For example, the extended cDNAs can be cloned into the expression 
vector pED6dpc2 (DiscoverEase, Genetics Institute, Cambridge, MA) as follows. The 
structure of pED6dpc2 is shown in Figure 7. pED6dpc2 vector DNA is prepared with 

25 blunt ends by performing an EcoRI digestion followed by a fill in reaction. The blunt 
ended vector is dephosphorylated. After removal of PGR primers and ethanol 
precipitation, the PGR product containing the full coding sequence or the extended cDNA 
obtained as described above is phosphorylated with a kinase subsequently removed by 
phenol-Sevag extraction and precipitation. The double stranded extended cDNA is then 
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ligated to the vector and the resulting expression plasmid introduced into appropriate host 
cells. 

Since the PGR products obtained as described above are blunt ended molecules 
that can be cloned in either direction, the orientation of several clones for each PGR 
product is determined. Then, 4 to 1 0 clones are ordered in microtiter plates and subjected 
to a PGR reaction using a first primer located in the vector close to the cloning site and a 
second primer located in the portion of the extended cDNA corresponding to the 3' end of 
the mRNA. This second primer may be the antisense primer used in anchored PGR in the 
case of direct cloning (case a) or the antisense primer located inside the 3 'UTR in the case 
of indirect cloning (case b). Glones in which the start codon of the extended cDNA is 
operably linked to the promoter in the vector so as to permit expression of the protein 
encoded by the extended cDNA are conserved and sequenced. In addition to the ends of 
cDNA inserts, approximately 50 bp of vector DNA on each side of the cDNA insert are 
also sequenced. 

The cloned PGR products are then entirely sequenced according to the 
aforementionedprocedure. In this case, contig assembly of long fragments is then 
performed on walking sequences that have already contigated for uncloned PGR products 
during primer walking. Sequencing of cloned amplicons is complete when the resultmg 
contigs include the whole coding region as well as overlapping sequences with vector 
DNA on both ends. 

4. Computer Analvsis of Full Length Extended cDNA 

Sequences of all full length extended cDNAs are then submitted to further 
analysis as described below and using the parameters found in Table II with the 
following modifications. For screening of miscellaneous subdivisions of Genbank, 
FASTA was used instead of BLASTN and 15 nucleotide of homology was the limit 
instead of 17. For Alu detection, BLASTN was used with the following parameters: 
S=72; identity=70%; and length = 40 nucleotides. Polyadenylation signal and polyA 
tail which were not search for the 5' ESTs were searched. For polyadenylation signal 
detection the signal (AATAAA) was searched with one permissible mismatch in the last 
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ten nucleotides preceding the 5' end of the poly A. For the poly A, a stretch of 8 amino 
acids in the last 20 nucleotides of the sequence was searched with BL AST2N in the 
sense strand with the following parameters (W=6, S=10, E=1000, and identity=90%). 
Finally, patented sequences and ORF homologies were searched using, respectively, 
BLASTN and BLASTP on GenSEQ (Derwent's database of patented nucleotide 
sequences) and SWISSPROT for ORFs with the following parameters (W=8 and 
B=10). Before examining the extended full length cDNAs for sequences of interest, 
extended cDNAs which are not of interest are searched as follows, 
a) Elimination of vindesired sequences 

Although 5'ESTs were checked to remove contaminants sequences as described in 
Example 1 8, a last verification was carried out to identify extended cDNAs sequences 
derived from undesired sequences such as vector RNAs, transfer RNAs, ribosomal 
rRNAs, mitochondrial RNAs, prokaryotic RNAs and fungal RNAs using the FASTA 
and BLASTN programs on both strands of extended cDNAs as described below. 

To identify the extended cDNAs encoding vector RNAs, extended cDNAs are 
compared to the known sequences of vector RNA using the FASTA program. 
Sequences of extended cDNAs with more than 90% homology over stretches of 15 
nucleotides are identified as vector RNA. 

To identify the extended cDNAs encoding tRNAs, extended cDNA sequences 
were compared to the sequences of 1 1 90 known tRNAs obtained from EMBL release 3 8, 
of which 1 00 were human. Sequences of extended cDNAs having more than 80% 
homology over 60 nucleotides using FASTA were identified as tRNA. 

To identify the extended cDNAs encoding rRNAs, extended cDNA sequences 
were compared to the sequences of 2497 known rRNAs obtained from EMBL release 3 8, 
of which 73 were human. Sequences of extended cDNAs having more than 80% 
homology over stretches longer than 40 nucleotides using BLASTN were identified as 
rRNAs. 

To identify the extended cDNAs encoding mtRNAs, extended cDNA sequences 
were compared to the sequences of the two known mitochondrial genomes for which the 
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entire genomic sequences are available and all sequences transcribed from these 
mitochondrial genomes including tRNAs, rRNAs, and mRNAs for a total of 38 
sequences. Sequences of extended cDNAs having more than 80% homology over 
stretches longer than 40 nucleotides using BL ASTN were identified as mtRNAs. 

Sequences which might have resulted from other exogenous contaminants were 
identified by comparuig extended cDNA sequences to release 105 of Genbank bacterial 
and fungal divisions. Sequences of extended cDNAs having more than 90% homology 
over 40 nucleotides using BL ASTN were identified as exogenous prokaryotic or ftingal 
contaminants. 

In addition, extended cDNAs were searched for different repeat sequences, 
including Alu sequences, LI sequences, THE and MER repeats, SSTR sequences or 
satellite, micro-satellite, or telomeric repeats. Sequences of extended cDNAs with more 
than 70% homology over 40 nucleotide stretches using BLASTN were identified as 
repeat sequences and masked in further identification procedures. In addition, clones 
showing extensive homology to repeats , i.e., matches of either more than 50 
nucleotides if the homology was at least 75% or more than 40 nucleotides if the 
homology was at least 85% or more than 30 nucleotides if the homology was at least 
90%, were flagged, 
b) Identification of structural features 

Structural features, e.g. polyA tail and polyadenylation signal, of the sequences 
of full length extended cDNAs are subsequently determined as follows. 

A polyA tail is defmed as a homopolymeric stretch of at least 1 1 A with at most 
one alternative base within it. The polyA tail search is restricted to the last 100 nt of the 
sequence and limited to stretches of 1 1 consecutive A's because sequencing reactions 
are often not readable after such a polyA stretch. Stretches having more than 90% 
homology over 8 nucleotides are identified as poIyA tails using BLAST2N. 

A polyA tail is defined as a homopolymeric stretch of at least 1 1 A with at most 
one alternative base within it. The polyA tail search is restricted to the last 20 nt of the 
sequence and limited to stretches of 1 1 consecutive A's because sequencing reactions 

-53- 



are often not readable after such a polyA stretch. Stretches with 100% homology over 6 
nucleotides are identified as polyA tails. 

To search for a polyadenylation signal, the polyA tail is clipped from the full- 
length sequence. The 50 bp preceding the polyA tail are searched for the canonic 
polyadenylation AAUAAA signal allowing one mismatch to account for possible 
sequencing errors and known variation in the canonical sequence of the polyadenylation 
signal. 

c) Identification of fiinctional features 

Functional features, e.g. ORFs and signal sequences, of the sequences of fiiU 
length extended cDNAs were subsequently determined as follows. 

The 3 upper strand frames of extended cDNAs are searched for ORFs defined as 
the maximum length firagments beginning with a franslation initiation codon and ending 
with a stop codon. ORFs encoding at least 20 amino acids are preferred. 

Each found ORF is then scanned for the presence of a signal peptide in the first 50 
amino-acids or, where appropriate, within shorter regions down to 20 amino acids or less 
in the ORF, using the matrix method of von Heijne (Nuc. Acids Res. 14: 4683-4690 
(1986)), the disclosure of which is incorporated herein by reference and the modification 
described in Example 22. 

d) Homology to either nucleotidic or proteic sequences 

Sequences of fiiU length extended cDNAs are then compared to known sequences 
on a nucleotidic or proteic basis. 

Sequences of frill length extended cDNAs are compared to the following known 
nucleic acid sequences: vertebrate sequences, EST sequences , patented sequences and 
recently identified sequences available at the time of filing the priority documents. Full 
length cDNA sequences are also compared to the sequences of a private database (Genset 
internal sequences) in order to find sequences that have already been identified by 
applicants. Sequences of frill length extended cDNAs with more than 90% homology 
over 30 nucleotides using either BLASTN or BLAST2N as indicated in Table III are 
identified as sequences that have already been described. Matching vertebrate sequences 
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are subsequently examined using FAST A; full length extended cDNAs with more than 
70% homology over 30 nucleotides are identified as sequences that have already been 
described, 

ORFs encoded by full length extended cDNAs as defined in section c) are 
subsequently compared to known amino acid sequences found in Swissprot release 
CHP, PIR release PIR# and Genpept release GPEPT public databases using BLASTP 
with the parameter W=8 and allowing a maximum of 10 matches. Sequences of full 
length extended cDNAs showing extensive homology to known protein sequences are 
recognized as already identified proteins. 

In addition, the three-jframe conceptual translation products of the top strand of 
full length extended cDNAs are compared to publicly known amino acid sequences of 
Swissprot using BLASTX with the parameter E=0.001. Sequences of full length 
extended cDNAs with more than 70% homology over 30 amino acid stretches are 
detected as already identified proteins. 

5. Selection of Cloned Full Length Sequences of the Present Invention 

Cloned fiill length extended cDNA sequences that have already been 
characterized by the aforementioned computer analysis are then submitted to an 
automatic procedure in order to preselect full length extended cDNAs containing 
sequences of interest, 
a) Automatic sequence preselection 

All complete cloned full length extended cDNAs clipped for vector on both ends 
are considered. First, a negative selection is operated in order to eliminate unwanted 
cloned sequences resulting from either contaminants or PCR artifacts as follows. 
Sequences matching contaminant sequences such as vector RNA, tRNA, mtRNA, rRNA 
sequences are discarded as well as those encoding ORF sequences exhibiting extensive 
homology to repeats as defined in section 4 a). Sequences obtained by direct cloning 
using nested primers on 5' and 3' tags (section 1. case a) but lacking polyA tail are 
discarded. Only ORFs containing a signal peptide and ending either before the polyA tail 
(case a) or before the end of the cloned 3'UTR (case b) are kept. Then, ORFs containing 
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unlikely mature proteins such as mature proteins which size is less than 20 amino acids or 
less than 25% of the immature protein size are eliminated. 

In the selection of the OFR, priority was given to the ORF and the frame 
corresponding to the polypeptides described in SignalTag Patents (United States Patent 
Application Serial Nos: 08/905,223; 08/905,135; 08/905,051; 08/905,144; 08/905,279; 
08/904,468; 08/905,134; and 08/905,133). If the ORF was not found among the OFRs 
described in the SignalTag Patents, the ORF encoding the signal peptide with the highest 
score according to Von Heijne method as defined in Example 22 was chosen. If the scores 
were identical, then the longest ORF was chosen. 

Sequences of full length extended cDNA clones are then compared pairwise with 
BLAST after masking of the repeat sequences. Sequences containing at least 90% 
homology over 30 nucleotides are clustered in the same class. Each cluster is then 
subjected to a cluster analysis that detects sequences resulting from internal priming or 
firom alternative splicing, identical sequences or sequences with several frameshifts. This 
automatic analysis serves as a basis for manual selection of the sequences, 
b) Manual sequence selection 

Manual selection is carried out using automatically generated reports for each 
sequenced full length extended cDNA clone. During this manual procedures, a 
selection is operated between clones belonging to the same class as follows. ORF 
sequences encoded by clones belonging to the same class are aligned and compared. If 
the homology between nucleotidic sequences of clones belonging to the same class is 
more than 90% over 30 nucleotide stretches or if the homology between amino acid 
sequences of clones belonging to the same class is more than 80% over 20 amino acid 
stretches, than the clones are considered as being identical. The chosen ORF is the best 
one according to the criteria mentioned below. If the nucleotide and amino acid 
homologies are less than 90% and 80% respectively, the clones are said to encode 
distinct proteins which can be both selected if they contain sequences of interest. 

Selection of full length extended cDNA clones encoding sequences of interest is 
performed using the following criteria. Structural parameters (initial tag, polyadenylation 

-56- 



site and signal) are first checked. Then, homologies with known nucleic acids and 
proteins are examined in order to determine whether the clone sequence match a known 
nucleic/proteic sequence and, in the latter case, its covering rate and the date at which the 
sequence became public. If there is no extensive match with sequences other than ESTs 
5 or genomic DNA, or if the clone sequence brings substantial new information, such as 
encoding a protein resulting from alternative slicing of an mRNA coding for an aheady 
known protein, the sequence is kept. Examples of such cloned full length extended 
cDNAs containing sequences of interest are described in Example 28. Sequences 
resulting from chimera or double inserts as assessed by homology to other sequences are 
1 0 discarded during this procedure. 

EXAMPLE 28 

Cloning and Sequencing of Extended cDNAs 
The procedure described in Example 27 above was used to obtain the extended 
cDNAs of the present invention. Using this approach, the full length cDNA of SEQ ID 
15 NO: 1 7 was obtained. This cDNA falls into the "EST-ext" category described above and 
encodes the signal peptide MKKVLLLITAILAVAVG (SEQ ID NO: 18) having a von 
Heijne score of 8.2. 

The full length cDNA of SEQ ID NO: 1 9 was also obtained using this procedure. 
This cDNA falls into the "EST-ext" category described above and encodes the signal 
20 peptide MWWFQQGLSFLPSALVIWTSA (SEQ ID NO:20) having a von Heijne score 
of5.5. 

Another full length cDNA obtained using the procedure described above has the 
sequence of SEQ ID N0:21. This cDNA, falls into the "EST-ext" category described 
above and encodes the signal peptide MVLTTLPSANSANSPVNMPTTGPNSLSYASSA 
25 LSPCLT (SEQ ID NO:22) having a von Heijne score of 5.9. 

The above procedure was also used to obtain a full length cDNA having the 
sequence of SEQ ID NO:23. This cDNA falls into the "EST-ext" category described 
above and encodes the signal peptide ILSTVTALTFAXA (SEQ ID NO:24) having a von 
Heijne score of 5.5. 
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The full length cDNA of SEQ ID NO:25 was also obtained using this procedure. 
This cDNA falls into the "new" category described above and encodes a signal peptide 
LVLTLCTLPLAVA(SEQ ID NO:26) having a von Heijne score of 10. 1 . 

The full length cDNA of SEQ ID NO:27 was also obtained using this procedure. 
5 This cDNA falls into the "new" category described above and encodes a signal peptide 
LWLLFFLVTAIHA(SEQ ID NO:28) having a von Heijne score of 10.7. 

The above procedures were also used to obtain the extended cDNAs of the present 
invention. 5' ESTs expressed in a variety of tissues were obtained as described above. 
The appended sequence listing provides the tissues from which the extended cDNAs were 
1 0 obtained. It v^dll be appreciated that the extended cDNAs may also be expressed in tissues 
other than the tissue Usted in the sequence listing. 

5' ESTs obtained as described above were used to obtain extended cDNAs having 
the sequences of SEQ ID NOs: 40-140 and 242-377. Table IV provides the sequence 
identification numbers of the extended cDNAs of the present invention, the locations of 
15 the full coding sequences in SEQ ID NOs: 40-140 and 242-377 (i.e. the nucleotides 
encoding both the signal peptide and the mature protein, listed under the heading PCS 
location in Table IV), the locations of the nucleotides in SEQ ID NOs: 40-140 and 242- 
377 which encode the signal peptides (listed under the heading SigPep Location in Table 
IV), the locations of the nucleotides in SEQ ID NOs: 40-140 and 242-377 which encode 
20 the mature proteins generated by cleavage of the signal peptides (listed under the heading 
Mature Polypeptide Location in Table IV), the locations in SEQ ID NOs: 40-140 and 242- 
377 of stop codons (listed under the heading Stop Codon Location in Table IV), the 
locations in SEQ ID NOs: 40-140 and 242-377 of polyA signals (listed under the heading 
Poly A Signal Location in Table IV) and the locations of polyA sites (listed under the 
25 heading Poly A Site Location in Table IV). 

The polypeptides encoded by the extended cDNAs were screened for the presence 
of known structural or functional motifs or for the presence of signatures, small amino 
acid sequences which are well conserved amongst the members of a protein family. The 
conserved regions have been used to derive consensus patterns or matrices included in the 
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PROSITE data bank, in particular in the file prosite.dat (Release 13.0 of November 1995, 
located at http://expasy.hcuge.ch/sprot/prosite.html. Prosite_convert and prosite_scan 
programs (http://ulrec3.uniLch/ftpserveur/prosite__scan)were used to find signatures on 
the extended cDNAs. 

5 For each pattern obtained with the prosite_convert program from the prosite.dat 

file, the accuracy of the detection on a new protein sequence has been tested by evaluating 
the frequency of irrelevant hits on the population of human secreted proteins mcluded in 
the data bank SWISSPROT. The ratio between the number of hits on shuffled proteins 
(with a window size of 20 amino acids) and the number of hits on native (unshuffled) 

10 proteins was used as an index. Every pattern for which the ration was greater than 20% 
(one hit on shuffled proteins for 5 hits on native proteins) was skipped during the search 
with prosite_scan. The program used to shuffle protein sequences (db_shuffled) and the 
program used to determine the statistics for each pattern in the protein data banks 
(prosite_statistics)are available on the ftp site http://ulrec3.uniLch/ftpserveur/prosite_scan. 

15 Table V lists the sequence identification numbers of the polypeptides of SEQ ID 

NOs: 141-241 and378-513,thelocationsof the amino acid residues of SEQ IDNOs: 141- 
241 and 378-513 in the full length polypeptide (second column), the locations of the 
amino acid residues of SEQ ID NOs: 141-241 and 378-513 in the signal peptides (third 
column), and the locations of the amino acid residues of SEQ ID NOs: 141-241 and 378- 

20 5 13 in the mature polypeptide created by cleaving the signal peptide fi-om the fiill length 
polypeptide (fourth column). 

The nucleotide sequences of the sequences of SEQ ID NOs: 40-140 and 242-377 
and the amino acid sequences encoded by SEQ ID NOs: 40-140 and 242-377 (i.e. amino 
acid sequences of SEQ ID NOs: 141-241 and 378-513) are provided in the appended 

25 sequence listing. In some instances, the sequences are preliminary and may include some 
incorrect or ambiguous sequences or amino acids. The sequences of SEQ ID NOs: 40-140 
and 242-377 can readily be screened for any errors therein and any sequence ambiguities 
can be resolved by resequencing a fragment containing such errors or ambiguities on both 
strands. Nucleic acid fragments for resolving sequencing errors or ambiguities may be 
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obtained from the deposited clones or can be isolated using the techniques described 
herein. Resolution of any such ambiguities or errors may be facilitated by using primers 
which hybridize to sequences located close to the ambiguous or erroneous sequences. For 
example, the primers may hybridize to sequences within 50-75 bases of the ambiguity or 
5 error. Upon resolution of an error or ambiguity, the corresponding corrections can be 
made in the protein sequences encoded by the DNA containing the error or ambiguity. 
For example, in the sequences of the present invention, ambiguities in the sequence of 
SEQ ID NO: 131 were resolved. The amino acid sequence of the protem encoded by a 
particular clone can also be determined by expression of the clone in a suitable host cell, 

1 0 collecting the protein, and determining its sequence. 

For each amino acid sequence. Applicants have identified what they have 
determined to be the reading frame best identifiable with sequence information available 
at the time of filing. Some of the amino acid sequences may contain "Xaa" designators. 
These "Xaa" designators indicate either (1) a residue which cannot be identified because 

15 of nucleotide sequence ambiguity or (2) a stop codon m the determined sequence where 
Applicants beheve one should not exist (if the sequence were determined more 
accurately). 

Cells containing the extended cDNAs (SEQ ID NOs: 40-140 and 242-377) of the 
present invention in the vector pED6dpc2, are maintained in permanent deposit by the 

20 inventors at Genset, S.A., 24 Rue Royale, 75008 Paris, France. 

Pools of cells containing the extended cDNAs (SEQ ID NOs: 40-140 and 242- 
377), from which cells containing a particular polynucleotide are obtainable, were 
deposited with the American Type Culture Collection, 1 080 1 University Blvd., Manassas, 
VA 20110-2209 or the European Collection of Cell Cultures, Vaccine Research and 

25 Production Laboratory, Public Health Laboratory Service, Centre for Applied 
Microbiology and Research, Porton Down, Salisbury, Wiltshire SP4 OJG, United 
Kingdom. Each extended cDNA clone has been transfected into separate bacterial cells 
(E-coli) for this composite deposit. Table VI lists the deposit numbers of the clones of 
SEQ ID Nos: 40- 1 40. One or more pools of cells containing the extended cDNAs of SEQ 
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ID Nos: 242-377, from which the cells containing a particular polynucleotide is 
obtainable, will be deposited with the European Collection of Cell Cultures, Vaccine 
Research and Production Laboratory, Public Health Laboratory Service, Centre for 
Applied Microbiology and Research, Porton Down, Salisbury, Wiltshire SP4 OJG, 
5 United Kingdom and will be assigned ECACC deposit number XXXXXXX, Table VII 
provides the internal designation number assigned to each SEQ ID NO. and indicates 
whether the sequence is a nucleic acid sequence or a protein sequence. 

Each extended cDNA can be removed from the pED6dpc2 vector in which it was 
deposited by performing a NotI, PstI double digestion to produce the appropriate fragment 
1 0 for each clone. The proteins encoded by the extended cDNAs may also be expressed from 
the promoter in pED6dpc2. 

Bacterial cells containing a particular clone can be obtained from the composite 
deposit as follows: 

An oligonucleotide probe or probes should be designed to the sequence that is 
15 known for that particular clone. This sequence can be derived from the sequences 

provided herein, or from a combination of those sequences. The design of the 

oligonucleotide probe should preferably follow these parameters: 

(a) It should be designed to an area of the sequence which has the fewest 

ambiguous bases ("N's"), if any; 
20 (b) Preferably, the probe is designed to have a T^ of approx. 80°C (assuming 2 

degrees for each A or T and 4 degrees for each G or C). However, probes having 

melting temperatures between 40 °C and 80 °C may also be used provided that 

specificity is not lost. 

The oligonucleotide should preferably be labeled with y-[^^P]ATP (specific 
25 activity 6000 Ci/mmole) and T4 polynucleotide kinase using commonly employed 

techniques for labeling oligonucleotides. Other labeling techniques can also be used. 

Unincorporated label should preferably be removed by gel filtration chromatography or 

other established methods. The amount of radioactivity incorporated into the probe 
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should be quantified by measurement in a scintillation counter. Preferably, specific 
activity of the resulting probe should be approximately 4X10^ dpm/pmole. 

The bacterial culture containing the pool of full-length clones should preferably be 
thawed and 100 |j,l of the stock used to inoculate a sterile culture flask containing 25 ml of 
sterile L-broth containing ampicillin at 100 ug/ml. The culture should preferably be 
grown to saturation at 37°C, and the saturated culture should preferably be diluted in fi-esh 
L-broth. Aliquots of these dilutions should preferably be plated to determine the dilution 
and volume which will yield approximately 5000 distinct and well-separated colonies on 
solid bacteriological media containing L-broth containing ampicillin at 100 f^g/ml and 
agar at 1.5% in a 150 mm petri dish when grown overnight at 37°C. Other known 
methods of obtaining distinct, well-separated colonies can also be employed. 

Standard colony hybridization procedures should then be used to transfer the 
colonies to nitrocellulose filters and lyse, denature and bake them. 

The filter is then preferably incubated at 65°C for 1 hour with gentle agitation in 
6X SSC (20X stock is 175.3 g NaCl/liter, 88.2 g Na citrate/liter, adjusted to pH 7.0 with 
NaOH) containmg 0.5% SDS, lOOpg/ml of yeast RNA, and lOmM EDTA 
(approximately lOmL per 150 mm filter). Preferably, the probe is then added to the 
hybridization mix at a concentration greater than or equal to 1X10* dpm/mL. Thefilteris 
then preferably incubated at 65°C with gentle agitation overnight. The filter is then 
preferably washed in 500 mL of 2X SSC/0.1% SDS at room temperature with gentle 
shaking for 1 5 minutes. A third wash with 0. IX SSC/0.5% SDS at 65°C for 30 minutes to 
1 hour is optional. The filter is then preferably dried and subjected to autoradiography for 
sufficient time to visuahze the positives on the X-ray film. Other known hybridization 
methods can also be employed. 

The positive colonies are picked, grown in culture, and plasmid DNA isolated 
using standard procedures. The clones can then be verified by restriction analysis, 
hybridization analysis, or DNA sequencing. 

The plasmid DNA obtained using these procedures may then be manipulated using 
standard cloning techniques familiar to those skilled in the art. Alternatively, a PGR can 
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be done with primers designed at both ends of the extended cDNA insertion. For 
example, a PGR reaction may be conducted using a primer having the sequence 
GGCCATACACTTGAGTGAC (SEQ ID NO:38) and a primer having the sequence 
ATATAGACAAACGCACACC (SEQ. ID. NO:39). The PGR product which 
corresponds to the extended cDNA can then be manipulated using standard cloning 
techniques familiar to those skilled in the art. 

In addition to PGR based methods for obtaining extended cDNAs, traditional 
hybridization based methods may also be employed. These methods may also be used to 
obtain the genomic DNAs which encode the mRNAs from which the 5' ESTs were 
derived, mRNAs corresponding to the extended cDNAs, or nucleic acids which are 
homologous to extended cDNAs or 5' ESTs. Example 29 below provides an example of 
such methods. 

EXAMPLE 29 

Methods for Obtaining Extended cDNAs or Nucleic 
15 Acids Homologous to Extended cDNAs or 5' ESTs 

A fiill length cDNA library can be made using the strategies described in 
Examples 13, 14, 15, and 1 6 above by replacing the random nonamer used in Example 14 
with an oligo-dT primer. For instance, the oligonucleotide of SEQ ID NO: 14 may be 
used. 

20 Alternatively, a cDNA library or genomic DNA library may be obtained from a 

commercial source or made using techniques familiar to those skilled in the art. The 
library includes cDNAs which are derived from the mRNA corresponding to a 5' EST or 
which have homology to an extended cDNA or 5' EST. The cDNA library or genomic 
DNA library is hybridized to a detectable probe comprising at least 10 consecutive 

25 nucleotides from the 5' EST or extended cDNA using conventional techniques. 
Preferably, the probe comprises at least 12, 15, or 17 consecutive nucleotides from the 5' 
EST or extended cDNA. More preferably, the probe comprises at least 20-30 consecutive 
nucleotides from the 5' EST or extended cDNA. In some embodiments, the probe 
comprises at least 30 nucleotides from the 5' EST or extended cDNA. In other 
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embodiments, the probe comprises at least 40, at least 50, at least 75, at least 100, at least 
1 50, or at least 200 consecutive nucleotides from the 5' EST or extended cDNA. 

Techniques for identifying cDNA clones in a cDNA library which hybridize to a 
given probe sequence are disclosed in Sambrook et al., Molecular Cloning: A Laboratory 
Manual 2d Ed., Cold Spring Harbor Laboratory Press, 1 989. The same techniques may be 
used to isolate genomic DNAs. 

Briefly, cDNA or genomic DNA clones which hybridize to the detectable probe 
are identified and isolated for further manipulation as follows. A probe comprising at 
least 10 consecutive nucleotides from the 5' EST or extended cDNA is labeled with a 
detectable label such as a radioisotope or a fluorescent molecule. Preferably, the probe 
comprises at least 12, 15, or 17 consecutive nucleotides from the 5' EST or extended 
cDNA. More preferably, the probe comprises 20-30 consecutive nucleotides from the 5' 
EST or extended cDNA. In some embodiments, the probe comprises more than 30 
nucleotides from the 5' EST or extended cDNA. In some embodiments, the probe 
comprises at least 40, at least 50, at least 75, at least 100, at least 150, or at least 200 
consecutive nucleotides from the 5' EST or extended cDNA. 

Techniques for labeling the probe are well known and include phosphorylation 
with polynucleotide kinase, nick translation, in vitro transcription, and non-radioactive 
techniques. The cDNAs or genomic DNAs in the library are transferred to a nitrocellulose 
or nylon filter and denatured. After incubation of the filter with a blocking solution, the 
filter is contacted with the labeled probe and incubated for a sufficient amount of time for 
the probe to hybridize to cDNAs or genomic DNAs containing a sequence capable of 
hybridizing to the probe. 

By varying the stringency of the hybridization conditions used to identify extended 
cDNAs or genomic DNAs which hybridize to the detectable probe, extended cDNAS 
having different levels of homology to the probe can be identified and isolated. To 
identify extended cDNAs or genomic DNAs having a high degree of homology to the 
probe sequence, the melting temperature of the probe may be calculated using the 
following formulas: 
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For probes between 14 and 70 nucleotides in length the melting temperature (Tm) 
is calculated using the formula: Tm=81.5+16.6(log [Na+])+0.41 (fraction G+C)-(600/N) 
where N is the length of the probe. 

If the hybridization is carried out in a solution containing formamide, the melting 
temperature may be calculated using the equation Tm-81.5+16.6(log 
[Na+])+0.41 (fraction G+C)-(0.63% formamide)-(600/N) where N is the length of the 
probe. 

Prehybridization may be carried out in 6X SSC, 5X Denhardt's reagent, 0.5% 
SDS, lOO^ig denatured fragmented salmon sperm DNA or 6X SSC, 5X Denhardt's 
reagent, 0.5% SDS, lOO^ig denatured fragmented salmon sperm DNA, 50% formamide. 
The formulas for SSC and Denhardt's solutions are listed in Sambrook et al, supm. 

Hybridization is conducted by adding the detectable probe to the prehybridization 
solutions listed above. Where the probe comprises double stranded DNA, it is denatured 
before addition to the hybridization solution. The filter is contacted with the hybridization 
solution for a sufficient period of time to allow the probe to hybridize to extended cDNAs 
or genomic DNAs containing sequences complementary thereto or homologous thereto. 
For probes over 200 nucleotides in length, the hybridization may be carried out at 1 5-25°C 
below the Tm. For shorter probes, such as oligonucleotide probes, the hybridization may 
be conducted at 15-25°C below the Tm. Preferably, for hybridizations in 6X SSC, the 
hybridization is conducted at approximately 68°C. Preferably, for hybridizations in 50% 
formamide containing solutions, the hybridization is conducted at approximately 42°C. 

All of the foregoing hybridizations would be considered to be under "stringent" 
conditions. Following hybridization, the filter is washed in 2X SSC, 0.1% SDS at room 
temperature for 15 minutes. The filter is then washed with O.IX SSC, 0.5% SDS at room 
temperature for 30 minutes to 1 hour. Thereafter, the solution is washed at the 
hybridization temperature in O.IX SSC, 0.5% SDS. A final wash is conducted in O.IX 
SSC at room temperature. 
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Extended cDNAs, nucleic acids homologous to extended cDNAs or 5' ESTs, or 
genomic DNAs which have hybridized to the probe are identified by autoradiography or 
other conventional techniques. 

The above procedure may be modified to identify extended cDNAs, nucleic acids 
homologous to extended cDNAs, or genomic DNAs having decreasing levels of 
homology to the probe sequence. For example, to obtain extended cDNAs, nucleic acids 
homologous to extended cDNAs, or genomic DNAs of decreasing homology to the 
detectable probe, less stringent conditions may be used. For example, the hybridization 
temperature may be decreased in increments of 5°C from 68°C to 42°C in a hybridization 
buffer having a Na+ concentration of approximately IM. Following hybridization, the 
filter may be washed with 2X SSC, 0.5% SDS at the temperature of hybridization. These 
conditions are considered to be "moderate" conditions above 50°C and "low" conditions 
below 50°C. 

Alternatively, the hybridization may be carried out in buffers, such as 6X SSC, 
containing formamide at a temperature of 42°C. In this case, the concentration of 
formamide in the hybridization buffer may be reduced in 5% increments fi-om 50% to 0% 
to identify clones having decreasing levels of homology to the probe. Following 
hybridization, the filter may be washed with 6X SSC, 0.5% SDS at 50°C. These 
conditions are considered to be "moderate" conditions above 25% formamide and "low" 
conditions below 25% formamide. 

Extended cDNAs, nucleic acids homologous to extended cDNAs, or genomic 
DNAs which have hybridized to the probe are identified by autoradiography. 

If it is desired to obtain nucleic acids homologous to extended cDNAs, such as 
allelic variants thereof or nucleic acids encoding proteins related to the proteins encoded 
by the extended cDNAs, the level of homology between the hybridized nucleic acid and 
the extended cDNA or 5' EST used as the probe may readily be determined. To determine 
the level of homology between the hybridized nucleic acid and the extended cDNA or 
5'EST fi:om which the probe was derived, the nucleotide sequences of the hybridized 
nucleic acid and the extended cDNA or 5'EST fi-om which the probe was derived are 
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compared. For example, using the above methods, nucleic acids having at least 95% 
nucleic acid homology to the extended cDNA or 5'EST from which the probe was derived 
may be obtained and identified. Similarly, by using progressively less stringent 
hybridization conditions one can obtain and identify nucleic acids having at least 90%, at 
5 least 85%, at least 80% or at least 75% homology to the extended cDNA or 5'EST from 
which the probe was derived. The level of homology between the hybridized nucleic acid 
and the extended cDNA or 5' EST used as the probe may be further determined using 
BLAST2N; parameters may be adapted depending on the sequence length and degree of 
homology studied. In such comparisons, the default parameters or the parameters listed in 

1 0 Tables II and III may be used. 

To determine whether a clone encodes a protein having a given amount of 
homology to the protein encoded by the extended cDNA or 5' EST, the amino acid 
sequence encoded by the extended cDNA or 5' EST is compared to the amino acid 
sequence encoded by the hybridizing nucleic acid. Homology is determined to exist when 

15 an amino acid sequence in the extended cDNA or 5' EST is closely related to an amino 
acid sequence in the hybridizing nucleic acid. A sequence is closely related when it is 
identical to that of the extended cDNA or 5' EST or when it contains one or more amino 
acid substitutions therein in which amino acids having similar characteristics have been 
substituted for one another. Using the above methods, one can obtain nucleic acids 

20 encoding proteins having at least 95%, at least 90%, at least 85%, at least 80% or at least 
75% homology to the proteins encoded by the extended cDNA or 5'EST from which the 
probe was derived. Using the above methods and algorithms such as FASTA with 
parameters depending on the sequence length and degree of homology studied the level of 
homology may be determined. In determining the level of homology using FASTA, the 

15 default parameters or the parameters listed in Tables II or III may be used. 

Alternatively, extended cDNAs may be prepared by obtaining mRNA from the 
tissue, cell, or organism of interest using mRNA preparation procedures utilizing poly A 
selection procedures or other techniques known to those skilled in the art. A first primer 
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capable of hybridizing to the poly A tail of the mRNA is hybridized to the mRNA and a 
reverse transcriptionreaction is performed to generate a first cDNA strand. 

The first cDNA strand is hybridized to a second primer containing at least 10 
consecutive nucleotides of the sequences of the 5' EST for which an extended cDNA is 
desired. Preferably, the primer comprises at least 12, 15, or 17 consecutive nucleotides 
from the sequences of the 5' EST. More preferably, the primer comprises 20-30 
consecutive nucleotides from the sequences of the 5' EST. In some embodiments, the 
primer comprises more than 30 nucleotides from the sequences of the 5' EST. If it is 
desired to obtain extended cDNAs containing the frill protein coding sequence, including 
the authentic translation initiation site, the second primer used contains sequences located 
upstream of the translation initiation site. The second primer is extended to generate a 
second cDNA strand complementary to the first cDNA strand. Alternatively, RTPCR 
may be performed as described above using primers from both ends of the cDNA to be 
obtained. 

Extended cDNAs containing 5' fragments of the mRNA may be prepared by 
contacting an mRNA comprising the sequence of the 5' EST for which an extended cDNA 
is desired with a primer comprising at least 10 consecutive nucleotides of the sequences 
complementary to the 5' EST, hybridizing the primer to the mRNAs, and reverse 
transcribing the hybridized primer to make a first cDNA strand from the mRNAs. 
Preferably, the primer comprises at least 12, 15, or 17 consecutive nucleotides from the 5' 
EST. More preferably, the primer comprises 20-30 consecutive nucleotides from the 5' 
EST. 

Thereafter, a second cDNA strand complementary to the first cDNA strand is 
synthesized. The second cDNA strand may be made by hybridizing a primer 
complementary to sequences in the first cDNA strand to the first cDNA strand and 
extending the primer to generate the second cDNA strand. 

The double stranded extended cDNAs made using the methods described above 
are isolated and cloned. The extended cDNAs may be cloned into vectors such as 
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plasmids or viral vectors capable of replicating in an appropriate host cell. For example, 
the host cell may be a bacterial, mammalian, avian, or insect cell. 

Techniques for isolating mRNA, reverse transcribing a primer hybridized to 
mRNA to generate a first cDNA strand, extending a primer to make a second cDNA 
strand complementary to the first cDNA strand, isolating the double stranded cDNA and 
cloning the double stranded cDNA are well known to those skilled in the art and are 
described in Current Protocols in Molecular Biology, John Wiley 503 Sons, Inc. 1 997 and 
Sambrook et al. Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring 
Harbor Laboratory Press, 1989. 

Alternatively, kits for obtaining full length cDNAs, such as the GeneTrapper(Cat. 
No. 10356-020, Gibco, BRL), may be used for obtaining full length cDNAs or extended 
cDNAs. In this approach, full length or extended cDNAs are prepared from mRNA and 
cloned into double stranded phagemids. The cDNA library in the double stranded 
phagemids is then rendered single stranded by treatment with an endonuclease, such as the 
Gene II product of the phage Fl, and Exonuclease III as described in the manual 
accompanying the GeneTrapper kit. A biotinylated oligonucleotide comprising the 
sequence of a 5' EST, or a firagment containing at least 10 nucleotides thereof, is 
hybridized to the single stranded phagemids. Preferably, the fragment comprises at least 
12, 15, or 17 consecutive nucleotides from the 5' EST. More preferably, the fragment 
comprises 20-30 consecutive nucleotides from the 5' EST. In some procedures, the 
fragment may comprise more than 30 consecutive nucleotides from the 5' EST. For 
example, the fragment may comprises at least 40, at least 50, at least 75, at least 100, at 
least 1 50, or at least 200 consecutive nucleotides from the 5' EST. 

Hybrids between the biotinylated oUgonucleotide and phagemids having inserts 
contaming the 5' EST sequence are isolated by incubating the hybrids with streptavidin 
coated paramagnetic beads and retrieving the beads with a magnet. Thereafter, the 
resulting phagemids containing the 5' EST sequence are released from the beads and 
converted into double stranded DNA using a primer specific for the 5' EST sequence. The 
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resulting double stranded DNA is transformed into bacteria. Extended cDNAs containing 
the 5' EST sequence are identified by colony PGR or colony hybridization. 

A plurality of extended cDNAs containing full length protein coding sequences or 
sequences encoding only the mature protein remaining after the signal peptide is cleaved 
may be provided as cDNA libraries for subsequent evaluation of the encoded proteins or 
use in diagnostic assays as described below. 

IV. Expression of Proteins Encoded by Extended cDNAs Isolated Using 5' ESTs 

Extended cDNAs containing the full protein codmg sequences of their 
corresponding mRNAs or portions thereof, such as cDNAs encoding the mature protein, 
may be used to express the secreted proteins or portions thereof which they encode as 
described in Example 30 below. If desired, the extended cDNAs may contain the 
sequences encoding the signal peptide to facilitate secretion of the expressed protein. It 
will be appreciated that a plurality of extended cDNAs containing the full protein coding 
sequences or portions thereof may be simultaneously cloned into expression vectors to 
create an expression library for analysis of the encoded proteins as described below. 

EXAMPLE 30 

Expression of the Proteins Encoded bv Extended cDNAs or Portions Thereof 

To express the proteins encoded by the extended cDNAs or portions thereof, 
nucleic acids containing the codmg sequence for the proteins or portions thereof to be 
expressed are obtained as described in Examples 27-29 and cloned into a suitable 
expression vector. If desired, the nucleic acids may contain the sequences encoding the 
signal peptide to facilitate secretion of the expressed protein. For example, the nucleic 
acid may comprise the sequence of one of SEQ ID NOs: 40-140 and 242-377 listed in 
Table IV and in the accompanying sequence listing. Alternatively, the nucleic acid may 
comprise those nucleotides which make up the full coding sequence of one of the 
sequences of SEQ ID NOs: 40-140 and 242-377 as defined in Table IV above. 

It will be appreciated that should the extent of the full coding sequence (i.e. the 
sequence encoding the signal peptide and the mature protein resulting fi-om cleavage of 
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the signal peptide) differ from that listed in Table IV as a result of a sequencing eiror, 
reverse transcription or amplification error, mRNA splicing, post-translational 
modification of the encoded protein, enzymatic cleavage of the encoded protein, or other 
biological factors, one skilled m the art would be readily able to identify the extent of the 
full coding sequences in the sequences of SEQ ID NOs. 40-140 and 242-377. For 
example, the sequence of SEQ ID NO: 11 5 represents an alternatively spliced transcript of 
a previously identified mRNA. Accordingly, the scope of any claims herein relating to 
nucleic acids containing the full coding sequence of one of SEQ ID NOs. 40-140 and 242- 
377 is not to be construed as excluding any readily identifiable variations from or 
equivalents to the fioll coding sequences listed in Table IV Similarly, should the extent of 
the fiill length polypeptides differ from those mdicated in Table V as a result of any of the 
preceding factors, the scope of claims relating to polypeptides comprising the amino acid 
sequence of the full length polypeptides is not to be construed as excludmg any readily 
identifiable variations from or equivalents to the sequences listed in Table V. 

Alternatively, the nucleic acid used to express the protein or portion thereof may 
comprise those nucleotides which encode the mature protein (i.e. the protein created by 
cleaving the signal peptide off) encoded by one of the sequences of SEQ ID NOs: 40-140 
and 242-377 as defined in Table IV above. 

It will be appreciated that should the extent of the sequence encoding the mature 
protein differ from that listed in Table IV as a result of a sequencing error, reverse 
transcription or amplification error, mRNA splicing, post-translational modification of the 
encoded protein, enzymatic cleavage of the encoded protein, or other biological factors, 
one skilled in the art would be readily able to identify the extent of the sequence encoding 
the mature protein in the sequences of SEQ ID NOs. 40-140 and 242-377. Accordingly, 
the scope of any claims herein relating to nucleic acids containing the sequence encoding 
the mature protein encoded by one of SEQ ID Nos. 40-140 and 242-377 is not to be 
constiiied as excluding any readily identifiable variations from or equivalents to the 
sequences listed in Table IV. Thus, claims relating to nucleic acids containing the 
sequence encoding the mature protein encompass equivalents to the sequences listed in 
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Table IV, such as sequences encoding biologically active proteins resulting from post- 
translational modification, enzymatic cleavage, or other readily identifiable variations 
from or equivalents to the secreted proteins in addition to cleavage of the signal peptide. 
Similarly, should the extent of the mature polypeptides differ from those indicated in 
Table V as a result of any of the preceding factors, the scope of claims relating to 
polypeptides comprising the sequence of a mature protein included in the sequence of one 
of SEQ ID NOs. 141-241 and 378-513 is not to be construed as excluding any readily 
identifiable variations from or equivalents to the sequences listed in Table V. Thus, 
claims relating to polypeptides comprising the sequence of the mature protein encompass 
equivalents to the sequences listed in Table IV, such as biologically active proteins 
resulting from post-translational modification, enzymatic cleavage, or other readily 
identifiable variations from or equivalents to the secreted proteins in addition to cleavage 
of the signal peptide. It will also be appreciated that should the biologically active form of 
the polypeptides included in the sequence of one of SEQ ID NOs. 141-241 and 378-513 or 
the nucleic acids encoding the biologically active form of the polypeptides differ from 
those identified as the mature polypeptide in Table V or the nucleotides encoding the 
mature polypeptide in Table IV as a result of a sequencing error, reverse transcription or 
amplification error, mRNA splicing, post-translational modification of the encoded 
protein, enzymatic cleavage of the encoded protein, or other biological factors, one skilled 
in the art would be readily able to identify the amino acids in the biologically active form 
of the polypeptides and the nucleic acids encoding the biologically active form of the 
polypeptides. In such instances, the claims relating to polypetides comprising the mature 
protein included in one of SEQ ID NOs. 141-241 and 378-513 or nucleic acids comprising 
the nucleotides of one of SEQ ID NOs. 40-140 and 242-377 encoding the mature protein 
shall not be constiiied to exclude any readily identifiable variations from the sequences 
listed in Table IV and Table V. 

In some embodiments, the nucleic acid used to express tiie protein or portion 
thereof may comprise those nucleotides which encode the signal peptide encoded by one 
of the sequences of SEQ ID NOs: 40-140 and 242-377 as defined in Table IV above. 
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It will be appreciated that should the extent of the sequence encoding the signal 
peptide differ from that listed in Table IV as a result of a sequencing error, reverse 
transcription or amplification error, mRNA splicing, post-translational modification of the 
encoded protein, enzymatic cleavage of the encoded protein, or other biological factors, 
one skilled in the art would be readily able to identify the extent of the sequence encoding 
the signal peptide in the sequences of SEQ ID NOs. 40-140 and 242-377. Accordingly, 
the scope of any claims herein relating to nucleic acids containing the sequence encoding 
the signal peptide encoded by one of SEQ ID Nos. 40-140 and 242-377 is not to be 
construed as excluding any readily identifiable variations from the sequences listed in 
Table IV. Similarly, should the extent of the signal peptides differ from those indicated in 
Table V as a result of any of the preceding factors, the scope of claims relating to 
polypeptides comprising the sequence of a signal peptide included in the sequence of one 
of SEQ ID NOs. 141-241 and 378-513 is not to be construed as excluding any readily 
identifiable variations from the sequences listed in Table V. 

Alternatively, the nucleic acid may encode a polypeptide comprising at least 10 
consecutive amino acids of one of the sequences of SEQ ID NOs: 141-241 and 378-513. 
In some embodiments, the nucleic acid may encode a polypeptide comprising at least 15 
consecutive amino acids of one of the sequences of SEQ ID NOs: 141-241 and 378-513. 
In other embodiments, the nucleic acid may encode a polypeptide comprising at least 25 
consecutive amino acids of one of the sequences of SEQ ID NOs: 141-241 and 378-513. 
In other embodiments, the nucleic acid may encode a polypeptide comprising at least 60, 
at least 75, at least 100 or more than 100 consecutive amino acids of one of the sequences 
ofSEQIDNos: 141-241 and 378-513. 

The nucleic acids inserted into the expression vectors may also contain sequences 
upstream of the sequences encoding the signal peptide, such as sequences which regulate 
expression levels or sequences which confer tissue specific expression. 

The nucleic acid encoding the protein or polypeptide to be expressed is operably 
linked to a promoter in an expression vector using conventional cloning technology. The 
expression vector may be any of the mammalian, yeast, insect or bacterial expression 
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systems known in the art. Commercially available vectors and expression systems are 
available from a variety of suppliers including Genetics Institute (Cambridge, MA), 
Stratagene (La Jolla, California), Promega (Madison, Wisconsin), and Invitrogen (San 
Diego, California), If desired, to enhance expression and facilitate proper protein folding, 
the codon context and codon pairing of the sequence may be optimized for the particular 
expression organism in v^rhich the expression vector is introduced, as explained by 
Hatfield, et al., U.S. Patent No. 5,082,767, incorporated herein by this reference. 

The following is provided as one exemplary method to express the proteins 
encoded by the extended cDNAs corresponding to the 5' ESTs or the nucleic acids 
described above. First, the methionine initiation codon for the gene and the poly A signal 
of the gene are identified. If the nucleic acid encoding the polypeptide to be expressed 
lacks a methionine to serve as the initiation site, an initiating methionine can be introduced 
next to the first codon of the nucleic acid using conventional techniques. Similarly, if the 
extended cDNA lacks a poly A signal, this sequence can be added to the construct by, for 
example, splicing out the Poly A signal from pSG5 (Stratagene) using Bgll and Sail 
restriction endonuclease enzymes and incorporating it into the mammalian expression 
vector pXTl (Stratagene). pXTl contains the LTRs and a portion of the gag gene from 
Moloney Murine Leukemia Virus. The position of the LTRs in the construct allow 
efficient stable transfection. The vector includes the Herpes Simplex Thymidine Kinase 
promoter and the selectable neomycin gene. The extended cDNA or portion thereof 
encoding the polypeptide to be expressed is obtained by PCR from the bacterial vector 
using oligonucleotide primers complementary to the extended cDNA or portion thereof 
and containing restriction endonuclease sequences for Pst I incorporated into the 5'primer 
and Bglll at the 5' end of the corresponding cDNA 3' primer, taking care to ensure that the 
extended cDNA is positioned in frame with the poly A signal. The purified fragment 
obtained from the resulting PCR reaction is digested with PstI, blunt ended with an 
exonuclease, digested with Bgl II, purified and ligated to pXTl, now containing a poly A 
signal and digested with Bglll. 
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The ligated product is transfected into mouse NIH 3T3 cells using Lipofectin (Life 
Technologies, Inc., Grand Island, New York) under conditions outlined in the product 
specification. Positive transfectants are selected after growing the transfected cells in 
600ug/ml G41 8 (Sigma, St. Louis, Missouri). Preferably the expressed protein is released 
5 into the culture medium, thereby facilitating purification. 

Alternatively, the extended cDNAs may be cloned into pED6dpc2 as described 
above. The resulting pED6dpc2 constructs may be transfected into a suitable host cell, 
such as COS 1 cells. Methotrexate resistant cells are selected and expanded. Preferably, 
the protein expressed from the extended cDNA is released into the culture medium 
1 0 thereby facilitating purification. 

Proteins in the cidture medium are separated by gel electrophoresis. If desired, the 
proteins may be ammonium sulfate precipitated or separated based on size or charge prior 
to electrophoresis. 

As a control, the expression vector lacking a cDNA insert is introduced into host 
15 cells or organisms and the proteins in the medium are harvested. The secreted proteins 
present in the medium are detected using techniques such as Coomassie or silver staining 
or using antibodies against the protein encoded by the extended cDNA, Coomassie and 
silver staining techniques are familiar to those skilled in the art. 

Antibodies capable of specifically recognizing the protein of interest may be 
20 generated using synthetic 1 5-mer peptides having a sequence encoded by the appropriate 
5' EST, extended cDNA, or portion thereof The synthetic peptides are injected into mice 
to generate antibody to the polypeptide encoded by the 5' EST, extended cDNA, or 
portion thereof 

Secreted proteins from the host cells or organisms containing an expression vector 
25 which contains the extended cDNA derived from a 5' EST or a portion thereof are 
compared to those from the control cells or organism. The presence of a band in the 
medium from the cells containing the expression vector which is absent in the medium 
from the control cells indicates that the extended cDNA encodes a secreted protein. 
Generally, the band corresponding to the protein encoded by the extended cDNA will 
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have a mobility near that expected based on the number of amino acids in the open 
reading frame of the extended cDNA. However, the band may have a mobility different 
than that expected as a result of modifications such as glycosylation, ubiquitination, or 
enzymatic cleavage. 

5 Alternatively, if the protein expressed from the above expression vectors does not 

contain sequences directing its secretion, the proteins expressed from host cells containing 
an expression vector containing an insert encoding a secreted protein or portion thereof 
can be compared to the proteins expressed in host cells containing the expression vector 
without an insert. The presence of a band in samples fi:om cells containing the expression 

10 vector with an insert which is absent in samples from cells containing the expression 
vector vdthout an insert indicates that the desired protein or portion thereof is being 
expressed. Generally, the band will have the mobility expected for the secreted protein or 
portion thereof. However, the band may have a mobility different than that expected as a 
result of modifications such as glycosylation, ubiquitination, or enzymatic cleavage. 

15 The protein encoded by the extended cDNA may be purified using standard 

immunochromatography techniques. In such procedures, a solution containing the 
secreted protein, such as the culture medium or a cell extract, is applied to a column 
having antibodies against the secreted protein attached to the chromatography matrix. The 
secreted protein is allowed to bind the immunochromatography column. Thereafter, the 

20 colunm is washed to remove non-specifically bound proteins. The specifically bound 
secreted protein is then released from the column and recovered using standard 
techniques. 

If antibody production is not possible, the extended cDNA sequence or portion 
thereof may be incorporated into expression vectors designed for use in purification 
25 schemes employing chimeric polypeptides. In such strategies the coding sequence of the 
extended cDNA or portion thereof is inserted in frame with the gene encoding the other 
half of the chimera. The other half of the chimera may be P-globin or a nickel binding 
polypeptide encoding sequence. A chromatography matrix having antibody to p-globin or 
nickel attached thereto is then used to purify the chimeric protein. Protease cleavage sites 
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may be engineered between the P-globin gene or the nickel binding polypeptide and the 
extended cDNA or portion thereof. Thus, the two polypeptides of the chimera may be 
separated from one another by protease digestion. 

One useful expression vector for generating p-globin chimerics is pSG5 
(Stratagene), which encodes rabbit p-globin. Intron II of the rabbit p-globin gene 
facilitates splicing of the expressed transcript, and the polyadenylation signal incorporated 
into the construct increases the level of expression. These techniques as described are well 
known to those skilled in the art of molecular biology. Standard methods are published in 
methods texts such as Davis et al., (Basic Methods in Molecular Biology, L.G. Davis, 
M.D. Dibner, and J.F. Battey, ed., Elsevier Press, NY, 1986) and many of the methods are 
available from Stratagene, Life Technologies, Inc., or Promega. Polypeptide may 
additionally be produced from the construct using in vitro translation systems such as the 
In vitro Express™ Translation Kit (Stratagene). 

Following expression and purification of the secreted proteins encoded by the 5' 
ESTs, extended cDNAs, or fragments thereof, the purified proteins may be tested for the 
ability to bind to the surface of various cell types as described in Example 31 below. It 
will be appreciated that a plurality of proteins expressed from these cDNAs may be 
included in a panel of proteins to be simultaneously evaluated for the activities specifically 
described below, as well as other biological roles for which assays for determining activity 
are available. 

EXAMPLE 31 

Analysis of Secreted Proteins to Determine Whether thev Rind to the Cel 1 Snrfacp 
The proteins encoded by the 5' ESTs, extended cDNAs, or fragments thereof are 
cloned into expression vectors such as those described in Example 30. The proteins are 
purified by size, charge, immunochromatography or other techniques familiar to those 
skilled in the art. Following purification, the proteins are labeled using techniques known 
to those skilled in the art. The labeled proteins are incubated with cells or cell lines 
derived from a variety of organs or tissues to allow the proteins to bind to any receptor 
present on the cell surface. Following the incubation, the cells are washed to remove non- 
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specifically bound protein. The labeled proteins are detected by autoradiography. 
Alternatively, unlabeled proteins may be incubated with the cells and detected with 
antibodies having a detectable label, such as a fluorescent molecule, attached thereto. 

Specificity of cell surface binding may be analyzed by conducting a competition 
analysis in which various amounts of unlabeled protein are incubated along with the 
labeled protein. The amount of labeled protein bound to the cell surface decreases as the 
amount of competitive unlabeled protein increases. As a control, various amounts of an 
unlabeled protein unrelated to the labeled protein is included in some binding reactions. 
The amount of labeled protein bound to the cell surface does not decrease in binding 
reactions contaimng increasing amounts of unrelated unlabeled protein, indicating that the 
protein encoded by the cDNA binds specifically to the cell surface. 

As discussed above, secreted proteins have been shown to have a number of 
important physiological effects and, consequently, represent a valuable therapeutic 
resource. The secreted proteins encoded by the extended cDNAs or portions thereof made 
according to Examples 27-29 may be evaluated to determine their physiological activities 
as described below. 

EXAMPLE 32 

Assaving the Proteins Expressed from E xtended cDNAs or Portions Thereof for Cvtnkine 
Cell Proliferation or Cell Differentiation Activity 
As discussed above, secreted proteins may act as cytokines or may affect cellular 
proliferation or differentiation. Many protein factors discovered to date, including all 
known cytokines, have exhibited activity in one or more factor dependent cell 
proliferation assays, and hence the assays serve as a convenient confirmation of cytokine 
activity. The activity of a protein of the present invention is evidenced by any one of a 
number of routine factor dependent cell proliferation assays for cell lines including, 
without limitation, 32D, DA2, DAIG, TIO, B9, B9/11, BaFS, MC9/G, M+ (preB M+), 
2E8, RB5, DAI, 123, T1165, HT2, CTLL2, TF-1, Mo7c and CMK. The proteins 
encoded by the above extended cDNAs or portions thereof may be evaluated for their 
ability to regulate T cell or thymocyte proliferation in assays such as those described 
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above or in the following references, which are incorporated herein by reference: 
Current Protocols in Immunology, Ed. by J.E. Coligan et al, Greene Publishing 
Associates and Wiley-Interscience; Takai et al. J. Immunol. 137:3494-3500, 1986. 
BertagnoUi et al. J. Immunol. 145:1706-1712, 1990. BertagnoUi et al., CeUular 
Immunology 133:327-341, 1991. BertagnoUi, et al. J. Immunol. 149:3778-3783, 1992; 
Bowman et al., J. Immunol. 152:1756-1761, 1994. 

In addition, numerous assays for cytokine production and/or the proliferation of 
spleen cells, lymph node cells and thymocytes are known. These include the techniques 
disclosed in Current Protocols in Immunology. J.E. Coligan et al. Eds., Vol 1 pp. 
3.12.1-3.12.14 John Wiley and Sons, Toronto. 1994; and Schreiber, R.D. Current 
Protocols in Immunology., supra Vol 1 pp. 6.8.1-6.8.8, John Wiley and Sons, Toronto. 
1994. 

The proteins encoded by the cDNAs may also be assayed for the ability to regulate 
the proliferation and differentiation of hematopoietic or lymphopoietic cells. Many assays 
for such activity are familiar to those skilled in the art, including the assays in the 
following references, which are incorporated herein by reference: Bottomly, K., Davis, 
L.S. and Lipsky, P.E., Measurement of Human and Murine Interleukin 2 and Interleukin 
4, Current Protocols in Immunology., J.E. Coligan et al. Eds. Vol 1 pp. 6.3.1-6.3.12, 
John Wiley and Sons, Toronto. 1991; deVries et al., J. Exp. Med. 173:1205-121 1, 1991; 
Moreau et al., Nature 36:690-692, 1988; Greenberger et al., Proc. Natl. Acad. Sci. 
U.S.A. 80:2931-2938, 1983; Nordan, R., Measurement of Mouse and Human Interleukin 
6 Current Protocols in Immunology. J.E. Coligan et al. Eds. Vol 1 pp. 6.6.1-6.6.5, John 
Wiley and Sons, Toronto. 1991; Smith et al., Proc. Natl. Acad. Sci. U.S.A. 83:1857- 
1861, 1986; Bennett, F., Giannotti, J., Clark, S.C. and Turner, K.J., Measurement of 
Human Interleukin 1 1 Current Protocols in Immunology. J.E. Coligan et al. Eds. Vol 1 
pp. 6.15.1 John Wiley and Sons, Toronto. 1991; Ciarletta, A., Giannotti, J., Clark, S.C. 
and Turner, K. J., Measurement of Mouse and Human Interleukin 9 Current Protocols in 
Immunology. J.E. Coligan et al., Eds. Vol 1 pp. 6.13.1, John Wiley and Sons, Toronto. 
1991. 
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The proteins encoded by the cDNAs may also be assayed for their ability to 
regulate T-cell responses to antigens. Many assays for such activity are familiar to those 
skilled in the art, including the assays described in the following references, which are 
incorporated herein by reference: Chapter 3 (In Vitro Assays for Mouse Lymphocyte 
Function), Chapter 6 (Cytokines and Their Cellular Receptors) and Chapter 7, 
(Immunologic Studies in Humans) in Current Protocols in Immunology, J.E. Coligan et 
al. Eds. Greene Publishing Associates and Wiley-Interscience; Weinberger et al., Proc. 
Natl. Acad. Sci. USA 77:6091-6095, 1980; Weinberger et al., Eur. J. Immun. 11:405- 
411, 1981; Takai et al., J. Immunol. 137:3494-3500, 1986; Takai et al., J. Immunol. 
140:508-512,1988. 

Those proteins which exhibit cytokine, cell proliferation, or cell differentiation 
activity may then be formulated as phannaceuticals and used to treat clinical conditions in 
which induction of cell proliferation or differentiation is beneficial. Alternatively, as 
described in more detail below, genes encoding these proteins or nucleic acids regulating 
the expression of these proteins may be introduced into appropriate host cells to increase 
or decrease the expression of the proteins as desired. 

EXAMPLE 33 

Assaving th e Proteins Expressed fi-om Extended cDNAs or Portions 
Thereof for Activitv as Immune System Regulators 
The proteins encoded by the cDNAs may also be evaluated for their effects as 
immune regulators. For example, the proteins may be evaluated for their activity to 
influence thymocyte or splenocyte cytotoxicity. Numerous assays for such activity are 
familiar to those skilled in the art including the assays described in the following 
references, which are incorporated herein by reference: Chapter 3 (In Vitro Assays for 
Mouse Lymphocyte Function 3.1-3.19) and Chapter 7 (Immunologic studies in Humans) 
in Current Protocols in Immunology, J.E. Coligan et al. Eds, Greene PubUshing 
Associates and Wiley-Interscience; Herrmann et al., Proc. Natl. Acad. Sci. USA 78:2488- 
2492, 1981; Herrmann et al., J. Immunol. 128:1968-1974, 1982; Handa et al., J. 
Immunol. 135:1564-1572, 1985; Takai et al., J. Immunol. 137:3494-3500, 1986; Takai 
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et al., J. Immunol. 140:508-512, 1988; Herrmann et al., Proc. Natl. Acad. Sci. USA 

78:2488-2492, 1981; Herrmann et al., J. Immunol. 128:1968-1974, 1982; Handa et al, J. 
Immunol. 135:1564-1572, 1985; Takai et al., J. Immunol. 137:3494-3500, 1986; 
Bowman et al., J. Virology 61:1992-1998; Takai et al., J. Immunol. 140:508-512, 1988; 
BertagnoUiet al. Cellular Immunology 133:327-341, 1991; Brown et al., J. Immunol. 
153:3079-3092,1994. 

The proteins encoded by the cDNAs may also be evaluated for their effects on T- 
cell dependent immunoglobulin responses and isotype switching. Numerous assays for 
such activity are familiar to those skilled in the art, including the assays disclosed in the 
following references, which are incorporated herein by reference: Maliszewski, J. 
Immunol. 144:3028-3033, 1990; Mond, J.J. and Brunswick, M Assays for B Cell 
Function: In vitro Antibody Production, Vol 1 pp. 3.8.1-3.8.16 in Current Protocols in 
Immunology. J.E. Coligan et al Eds., John Wiley and Sons, Toronto. 1 994. 

The proteins encoded by the cDNAs may also be evaluated for their effect on 
immune effector cells, including their effect on Thl cells and cytotoxic lymphocytes. 
Numerous assays for such activity are familiar to those skilled in the art, including the 
assays disclosed in the following references, which are incorporated herein by reference: 
Chapter 3 (In Vitro Assays for Mouse Lymphocyte Function 3.1-3.19) and Chapter 7 
(Immunologic Studies in Humans) in Current Protocols in Immunology, J.E. Coligan et 
al. Eds., Greene Publishing Associates and Wiley-lnterscience; Takai et al., J. Immunol. 
137:3494-3500, 1986; Takai et al.; J. Immunol. 140:508-512, 1988; BertagnoUiet al., J. 
Immunol. 149:3778-3783,1992. 

The proteins encoded by the cDNAs may also be evaluated for their effect on 
dendritic cell mediated activation of naive T-cells. Numerous assays for such activity are 
familiar to those skilled in the art, including the assays disclosed in the following 
references, which are incorporated herein by reference: Query et al., J. Immunol. 
134:536-544, 1995; Inabaet al. Journal of Experimental Medicine 173:549-559, 1991; 
Macatoniaet al.. Journal of Immunology 154:5071-5079, 1995; Porgadoret al.. Journal 
of ExperimentalMedicine 182:255-260, 1995; Nair et al.. Journal of Virology 67:4062- 
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4069, 1993; Huang et al.. Science 264:961-965, 1994; Macatonia et al., Journal of 
Experimental Medicine 169:1255-1264, 1989; Bhardwaj et al., Journal of Clinical 
Investigation 94:797-807, 1994; and Inaba et al.. Journal of Experimental Medicine 

172:631-640,1990. 

The proteins encoded by the cDNAs may also be evaluated for their influence on 
the lifetime of lymphocytes. Numerous assays for such activity are familiar to those 
skilled m the art, including the assays disclosed in the following references, which are 
incorporated herein by reference: Darzynkiewicz et al.. Cytometry 13:795-808, 1992; 
Gorczyca et al.. Leukemia 7:659-670, 1993; Gorczyca et al.. Cancer Research 53:1945- 
1951, 1993; Itoh et al.. Cell 66:233-243, 1991; Zacharchuk, Journal of Immunology 
145:4037-4045, 1990; Zamai et al.. Cytometry 14:891-897, 1993; Gorczyca et al., 
InternationalJournal of Oncology 1:639-648, 1992. 

Assays for proteins that influence early steps of T-cell commitment and 
development include, without limitation, those described in: Antica et al.. Blood 84: 1 1 1 - 
117, 1994; Fine et al.. Cellular immunology 155:111-122, 1994; Galy et al.. Blood 
85:2770-2778, 1995; Toki et al, Proc. Nat. Acad Sci. USA 88:7548-7551, 1991. 

Those proteins which exhibit activity as immune system regulators activity may 
then be formulated as pharmaceuticals and used to treat clinical conditions in which 
regulation of immune activity is beneficial. For example, the protein may be useful in the 
treatment of various immune deficiencies and disorders (including severe combined 
immunodeficiency (SCID)), e.g., in regulating (up or down) growth and proliferation of T 
and/or B lymphocytes, as well as effecting the cytolytic activity of NK cells and other cell 
populations. These immune deficiencies may be genetic or be caused by viral (e.g., HIV) 
as well as bacterial or fungal infections, or may result fi-om autoimmune disorders. More 
specifically, infectious diseases caused by viral, bacterial, fimgal or other infection may be 
treatable using a protein of the present invention, including infections by HIV, hepatitis 
viruses, herpesviruses, mycobacteria, Leishmania spp., malaria spp. and various fungal 
infections such as candidiasis. Of course, in this regard, a protein of the present invention 
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may also be useful where a boost to the immune system generally may be desirable, i.e., in 
the treatment of cancer. 

Autoimmune disorders which may be treated using a protein of the present 
invention include, for example, connective tissue disease, multiple sclerosis, systemic 
lupus erythematosus, rheumatoid arthritis, autoimmune puhnonary inflammation, 
Guillain-Barre syndrome, autoimmune thyroiditis, insulin dependent diabetes mellitis, 
myasthenia gravis, graft-versus-host disease and autoimmune inflammatory eye disease. 
Such a protein of the present invention may also to be useful m the treatment of allergic 
reactions and conditions, such as asthma (particularly allergic asthma) or other respiratory 
problems. Other conditions, in which immune suppression is desired (including, for 
example, organ transplantation), may also be treatable using a protein of the present 
invention. 

Using the proteins of the invention it may also be possible to regulate immune 
responses, in a number of ways. Down regulation may be m the form of inhibiting or 
blocking an immune response already in progress or may involve preventing the induction 
of an immune response. The functions of activated T-cells may be inhibited by 
suppressing T cell responses or by inducing specific tolerance in T cells, or both. 
Immunosuppression of T cell responses is generally an active, non-antigen-specific, 
process which requnes continuous exposure of the T cells to the suppressive agent. 
Tolerance, which involves inducing non-responsiveness or anergy in T cells, is 
distinguishable fi-om immunosuppression in that it is generally antigen-specific and 
persists after exposure to the tolerizing agent has ceased. Operationally, tolerance can be 
demonstrated by the lack of a T cell response upon reexposure to specific antigen in the 
absence of the tolerizing agent. 

Down regulating or preventing one or more antigen functions (including without 
limitation B lymphocyte antigen fimctions (such as, for example, B7)), e.g., preventing 
high level lymphokine synthesis by activated T cells, will be useful in situations of tissue, 
skin and organ transplantation and in grafl-versus-host disease (GVHD). For example, 
blockage of T cell fimction should result in reduced tissue destiiiction in tissue 
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transplantation. Typically, in tissue transplants, rejection of the transplant is initiated 
through its recognition as foreign by T cells, followed by an unmune reaction that 
destroys the transplant. The administration of a molecule which inhibits or blocks 
interaction of a B7 lymphocyte antigen with its natural ligand(s) on immune cells (such as 
a soluble, monomeric form of a peptide having B7-2 activity alone or in conjunction with 
a monomeric form of a peptide having an activity of another B lymphocyte antigen (e.g., 
B7-1, B7-3) or blocking antibody), prior to transplantation can lead to the binding of the 
molecule to the natural ligand(s) on the immune cells without transmitting the 
corresponding costimulatory signal. Blocking B lymphocyte antigen function in this 
matter prevents cytokine synthesis by immune cells, such as T cells, and thus acts as an 
immunosuppressant. Moreover, the lack of costimulation may also be sufficient to 
anergize the T cells, thereby inducing tolerance in a subject. Induction of long-term 
tolerance by B lymphocyte antigen-blocking reagents may avoid the necessity of repeated 
administration of these blocking reagents. To achieve sufficient immunosuppression or 
tolerance in a subject, it may also be necessary to block the function of a combmationof B 
lymphocyte antigens. 

The efficacy of particular blocking reagents in preventing organ transplant 
rejection or GVHD can be assessed using animal models that are predictive of efficacy in 
humans. Examples of appropriate systems which can be used include allogeneic cardiac 
grafts in rats and xenogeneic pancreatic islet cell grafts in mice, both of which have been 
used to examine the immunosuppressive effects of CTLA4Ig fiision proteins in vivo as 
described in Lenschow et al., Science 257:789-792 (1992) and Turka et al, Proc. Natl. 
Acad. Sci USA, 89: 1 1 1 02-1 1 1 05 (1 992). In addition, murine models of GVHD (see Paul 
ed.. Fundamental Immunology, Raven Press, New York, 1989, pp. 846-847) can be used 
to determine the effect of blocking B lymphocyte antigen fiinction in vivo on the 
development of that disease. 

Blocking antigen function may also be therapeutically usefiil for treating 
autoimmune diseases. Many autoimmune disorders are the result of inappropriate 
activation of T cells that are reactive against self tissue and which promote the production 
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of cytokines and autoantibodies involved in the pathology of the diseases. Preventing the 
activation of autoreactive T cells may reduce or eliminate disease symptoms. 
Administration of reagents which block costimulation of T cells by disrupting receptor 
ligand interactions of B lymphocyte antigens can be used to inhibit T cell activation and 
5 prevent production of autoantibodies or T cell-derived cytokines which may be involved 
in the disease process. Additionally, blocking reagents may induce antigen-specific 
tolerance of autoreactive T cells which could lead to long-term relief from the disease. 
The efficacy of blocking reagents in preventing or alleviating autoimmune disorders can 
be determined using a number of well-characterized animal models of human autoimmune 
10 diseases. Examples include murine experimental autoimmune encephalitis, systemic 
lupus erythmatosis in MRL/pr/pr mice or NZB hybrid mice, murine autoimmuno collagen 
arthritis, diabetes mellitus in OD mice and BE rats, and murine experimental myasthenia 
gravis (see Paul ed., Fundamental Immunology, Raven Press, New York, 1989, pp. 840- 
856). 

1 5 Upregulation of an antigen function (preferably a B lymphocyte antigen function), 

as a means of up regulating immune responses, may also be useful in therapy. 
Upregulation of immune responses may be in the form of enhancing an existing immune 
response or eliciting an initial immune response. For example, enhancing an immune 
response through stimulating B lymphocyte antigen function may be useful in cases of 

20 viral infection. In addition, systemic viral diseases such as influenza, the common cold, 
and encephalitis might be alleviated by the administration of stimulatory form of B 
lymphocyte antigens systemically . 

Altematively , anti-viral immune responses may be enhanced in an infected patient 
by removing T cells from the patient, costimulatingthe T cells in vitro with viral antigen- 

25 pulsed APCs either expressing a peptide of the present invention or together with a 
stimulatory form of a soluble peptide of the present invention and reintroducing the in 
vitro activated T cells into the patient. The infected cells would now be capable of 
delivering a costimulatory signal to T cells in vivo, thereby activating the T cells. 
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In another application, up regulation or enhancement of antigen function 
(preferably B lymphocyte antigen function) may be useful in the induction of tumor 
inamunity. Tumor cells (e.g., sarcoma, melanoma, lymphoma, leukemia, neuroblastoma, 
carcinoma) transfected with a nucleic acid encoding at least one peptide of the present 
5 invention can be administered to a subject to overcome tumor-specific tolerance in the 
subj ect. If desired, the tumor cell can be transfected to express a combination of peptides. 
For example, tumor cells obtained from a patient can be transfected ex vivo with an 
expression vector directing the expression of a peptide having B7-2-like activity alone, or 
in conjunction with a peptide having B7-l-like activity and/or B7-3-like activity. The 

1 0 transfected tumor cells are retumed to the patient to result in expression of the peptides on 
the surface of the transfected cell. Alternatively, gene therapy techniques can be used to 
target a tumor cell for transfectionin vivo. 

The presence of the peptide of the present invention having the activity of a B 
lymphocyte antigen(s) on the surface of the tumor cell provides the necessary 

15 costimulation signal to T cells to induce a T cell mediated immune response against the 
transfected tumor cells. In addition, tumor cells which lack MHC class I or MHC class II 
molecules, or which fail to reexpress sufficient amounts of MHC class I or MHC class II 
molecules, can be transfected with nucleic acids encoding all or a portion of (e.g., a 
cytoplasmic-domain truncated portion) of an MHC class I a chain protein and p2 

20 macroglobulin protein or an MHC class II a chain protein and an MHC class II p cham 
protein to thereby express MHC class I or MHC class II proteins on the cell surface. 
Expression of the appropriate class II or class II MHC in conjunction with a peptide 
having the activity of a B lymphocyte antigen (e.g., B7-1, B7-2, B7-3) induces a T cell 
mediated immune response against the transfected tumor cell. Optionally, a gene 

25 encoding an antisense construct which blocks expression of an MHC class II associated 
protein, such as the invariant chain,can also be cotransfected with a DNA encoding a 
peptide having the activity of a B lymphocyte antigen to promote presentation of tumor 
associated antigens and induce tumor specific immunity. Thus, the induction of a T cell 
mediated immune response in a human subject may be sufficient to overcome tumor- 
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specific tolerance in the subject. Alternatively, as described in more detail below, genes 
encoding these proteins or nucleic acids regulating the expression of these proteins may be 
introduced into appropriate host cells to increase or decrease the expression of the proteins 
as desired. 

EXAMPLE 34 

Assaying the Proteins Expressed from Extended cDNAs 
or Portions Thereof for Hematopoiesis Regulating Activity 

The proteins encoded by the extended cDNAs or portions thereof may also be 
evaluated for their hematopoiesis regulating activity. For example, the effect of the 
proteins on embryonic stem cell differentiation may be evaluated. Numerous assays for 
such activity are familiar to those skilled in the art, including the assays disclosed in the 
following references, which are incorporated herein by reference: Johansson et al. 
Cellular Biology 15:141-151, 1995; Keller et al.. Molecular and Cellular Biology 
13:473-486, 1993; McClanahanet al, Blood 81 :2903-2915, 1993. 

The proteins encoded by the extended cDNAs or portions thereof may also be 
evaluated for their influence on the lifetime of stem cells and stem cell differentiation. 
Numerous assays for such activity are familiar to those skilled in the art, including the 
assays disclosed in the following references, which are incorporated herein by reference: 
Freshney, M.G. Methylcellulose Colony Forming Assays, in Culture of Hematopoietic 
Cells. R.I. Freshney, et al. Eds. pp. 265-268, Wiley-Liss, Inc., New York, NY. 1994; 
Hirayama et al., Proc. NatL Acad. Sci. USA 89:5907-5911, 1992; McNiece, I.K. and 
Briddell, R.A. Primitive Hematopoietic Colony Forming Cells with High Proliferative 
Potential, in Culture of Hematopoietic Cells. R.L Freshney, et al, eds. Vol pp. 23-39, 
Wiley-Liss, Inc., New York, NY. 1994; Neben et al.. Experimental Hematology 22:353- 
359, 1994; Ploemacher, R.E. Cobblestone Area Forming Cell Assay, In Culture of 
Hematopoietic Cells. R.L Freshney, et al. Eds. pp. 1-21, Wiley-Liss, Inc., New York, 
NY. 1994; Spooncer, E., Dexter, M. and Allen, T. Long Term Bone Marrow Cultures in 
the Presence of Stromal Cells, in Culture of Hematopoietic Cells. R.L Freshney, et al. 
Eds. pp. 163-179, Wiley-Liss, Inc., New York, NY. 1994; and Sutherland, H.J. Long 
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Term Culture Initiating Cell Assay, in Culture of Hematopoietic Cells. R.L Freshney, et 
al Eds. pp. 139-162, Wiley-Lissjnc, New York, NY. 1994. 

Those proteins which exhibit hematopoiesis regulatory activity may then be 
formulated as pharmaceuticals and used to treat clinical conditions in which regulation of 
5 hematopoeisis is beneficial. For example, a protein of the present invention may be useful 
in regulation of hematopoiesis and, consequently , in the treatment of myeloid or lymphoid 
cell deficiencies. Even marginal biological activity in support of colony forming cells or 
of factor-dependent cell lines indicates involvement in regulating hematopoiesis, e.g. in 
supporting the growth and proliferation of erythroid progenitor cells alone or in 

10 combination with other cytokines, thereby indicating utility, for example, in treating 
various anemias or for use in conjunction with irradiation/chemotherapy to stimulate the 
production of erythroid precursors and/or erythroid cells; in supporting the growth and 
proliferation of myeloid cells such as granulocytes and monocytes/macrophages (i.e., 
traditional CSF activity) useful, for example, in conjunction with chemotherapy to prevent 

15 or treat consequent myelo-suppression; in supporting the grov^h and proliferation of 
megakaryocytes and consequently of platelets thereby allowing prevention or treatment of 
various platelet disorders such as thrombocytopenia, and generally for use in place of or 
complimentary to platelet transfusions; and/or in supporting the grovrth and prohferation 
of hematopoietic stem cells which are capable of maturing to any and all of the above- 

20 mentioned hematopoietic cells and therefore find therapeutic utility in various stem cell 
disorders (such as those usually treated with transplantion, including, without limitation, 
aplastic anemia and paroxysmal nocturnal hemoglobinuria), as well as in repopulating the 
stem cell compartment post irradiation/chemotherapy, either in-vivo or ex-vivo (i.e., in 
conjunction with bone marrow transplantation or with peripheral progenitor cell 

25 transplantation (homologous or heterologous)) as normal cells or genetically manipulated 
for gene therapy. Alternatively, as described in more detail below, genes encoding these 
proteins or nucleic acids regulating the expression of these proteins may be introduced 
into appropriate host cells to increase or decrease the expression of the proteins as desired. 

EXAMPLE 35 
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Assaying the Proteins Expressed from Extended cDNAs or Portions Thereof for 

Regulation of Tissue Growth 
The proteins encoded by the extended cDNAs or portions thereof may also be 
evaluated for their effect on tissue growth. Numerous assays for such activity are familiar 
5 to those skilled in the art, including the assays disclosed in International Patent Publication 
No. WO95/16035, International Patent Publication No. WO95/05846 and International 
Patent Publication No. WO91/0749 1 , which are incorporated herein by reference. 

Assays for wound healing activity include, without limitation, those described in: 
Winter. Epidermal Wound Healing , pps. 71-112(Maibach,Hl andRovee,DT, eds.), Year 
10 Book Medical Publishers, Inc., Chicago, as modified by Eaglstein and Mertz, J. Invest. 
Dermatol 71 :382-84 (1978) which are incorporated herein by reference. 

Those proteins which are involved in the regulation of tissue growth may then be 
formulated as pharmaceuticals and used to treat clinical conditions in which regulation of 
tissue growth is beneficial. For example, a protein of the present invention also may have 
15 utility in compositions used for bone, cartilage, tendon, ligament and/or nerve tissue 
growth or regeneration, as well as for wound healing and tissue repair and replacement, 
and in the treatment of bums, incisions and ulcers. 

A protein of the present invention, which induces cartilage and/or bone grovrth in 
circumstances where bone is not normally formed, has application in the healing of bone 
20 fractures and cartilage damage or defects in humans and other animals. Such a 
preparation employing a protein of the invention may have prophylactic use in closed as 
well as open fracture reduction and also in the improved fixation of artificial joints. De 
novo bone formation induced by an osteogenic agent contributes to the repair of 
congenital, trauma induced, or oncologic resection induced craniofacial defects, and also 
25 is useful in cosmetic plastic surgery. 

A protein of this invention may also be used in the treatment of periodontal 
disease, and in other tooth repair processes. Such agents may provide an environment to 
attract bone-forming cells, stimulate growth of bone-forming cells or induce 
differentiation of progenitors of bone-forming cells. A protein of the invention may also 
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be useful in the treatment of osteoporosis or osteoarthritis, such as through stimulation of 
bone and/or cartilage repair or by blocking inflammation or processes of tissue destruction 
(coUagenase activity, osteoclast activity, etc.) mediated by inflammatory processes. 

Another category of tissue regeneration activity that may be attributable to the 
5 protein of the present invention is tendon/ligament formation. A protein of the present 
invention, which induces tendon/ligament-like tissue or other tissue formation in 
circumstances v^here such tissue is not normally formed, has application in the healing of 
tendon or ligament tears, deformities and other tendon or ligament defects in humans and 
other animals. Such a preparation employing a tendon/ligament-like tissue inducing 

10 protein may have prophylactic use in preventing damage to tendon or ligament tissue, as 
well as use in the improved fixation of tendon or ligament to bone or other tissues, and in 
repairing defects to tendon or ligament tissue. De novo tendon/ligament-like tissue 
formation induced by a composition of the present invention contributes to the repair of 
congenital, trauma induced, or other tendon or ligament defects of other origin, and is also 

15 useful in cosmetic plastic surgery for attachment or repair of tendons or ligaments. The 
compositions of the present invention may provide an environment to attract tendon- or 
ligament-forming cells, stimulate growth of tendon- or ligament-forming cells, induce 
differentiation of progenitors of tendon- or ligament-forming cells, or induce growth of 
tendon/ligament cells or progenitors ex vivo for return in vivo to effect tissue repair. The 

20 compositions of the invention may also be useful in the treatment of tendinitis, carpal 
tunnel syndrome and other tendon or ligament defects. The compositions may also 
include an appropriate matrix and/or sequestering agent as a carrier as is well known in the 
art. 

The protein of the present invention may also be useful for proliferation of neural 
25 cells and for regeneration of nerve and brain tissue, i.e., for the treatment of central and 
peripheral nervous system diseases and neuropathies, as well as mechanical and traumatic 
disorders, which involve degeneration, death or trauma to neural cells or nerve tissue. 
More specifically, a protein may be used in the treatment of diseases of the peripheral 
nervous system, such as peripheral nerve injuries, peripheral neuropathy and localized 



neuropathies, and central nervous system diseases, such as Alzheimer's, Parkinson's 
disease, Huntington's disease, amyotrophic lateral sclerosis, and Shy-Drager syndrome. 
Further conditions which may be treated in accordance with the present invention include 
mechanical and traumatic disorders, such as spinal cord disorders, head trauma and 
5 cerebrovascular diseases such as stroke. Peripheral neuropathies resulting from 
chemotherapy or other medical therapies may also be treatable using a protein of the 
invention. 

Proteins of the invention may also be useful to promote better or faster closure of 
non-healing wounds, including v^thout limitation pressure ulcers, ulcers associated with 
1 0 vascular insufficiency, surgical and traumatic wounds, and the like. 

It is expected that a protein of the present invention may also exhibit activity for 
generation or regeneration of other tissues, such as organs (including, for example, 
pancreas, liver, intestine, kidney, skin, endothelium) muscle (smooth, skeletal or cardiac) 
and vascular (including vascular endothelium) tissue, or for promoting the groMlh of cells 
1 5 comprising such tissues. Part of the desired effects may be by inhibition or modulation of 
fibrotic scarring to allow normal tissue to generate. A protein of the invention may also 
exhibit angiogenic activity. 

A protein of the present invention may also be useM for gut protection or 
regeneration and treatment of lung or liver fibrosis, reperfusion injury in various tissues, 
20 and conditions resulting from systemic cytokine damage. 

A protein of the present invention may also be useful for promoting or inhibiting 
differentiation of tissues described above from precursor tissues or cells; or for inhibiting 
the growth of tissues described above. 

Alternatively, as described in more detail below, genes encoding these proteins or 
25 nucleic acids regulating the expression of these proteins may be introduced into 
appropriate host cells to increase or decrease the expression of the proteins as desired. 

EXAMPLE 36 

Assaying the Proteins Expressed from Extended cDNAs or Portions 
Thereof for Regulation of Reproductive Hormones or Cell Movement 
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The proteins encoded by the extended cDNAs or portions thereof may also be 
evaluated for their ability to regulate reproductive hormones, such as follicle stimulating 
hormone. Numerous assays for such activity are familiar to those skilled in the art, 
including the assays disclosed in the following references, which are incorporated herein 
5 by reference: Vale et a!., Endocrinology 91 :562-572, 1972; Ling et al., Nature 321 :779- 
782, 1986; Vale et al. Nature 321:776-779, 1986; Mason et aL, Nature 318:659-663, 
1985; Forage et al., Proc, NatL Acad. ScL USA 83:3091-3095, 1986. Chapter 6.12 
(Measurement of Alpha and Beta Chemokines) Current Protocols in Immunology, J.E. 
Coligan et al Eds. Greene Publishing Associates and Wiley-Intersciece ; Taub et al. J. 

10 Clin. Invest. 95:1370-1376, 1995; Lind et al APMIS 103:140-146, 1995; MuUer et al. 
Eur. J. Immunol. 25:1744-1748; Gruber et al. J. of Immunol. 152:5860-5867, 1994; 
Johnston et al. J. of Immunol. 153:1762-1768, 1994. 

Those proteins which exhibit activity as reproductive hormones or regulators of 
cell movement may then be formulated as phamiaceuticals and used to treat clinical 

1 5 conditions in which regulation of reproductive hormones or cell movement are beneficial. 
For example, a protein of the present invention may also exhibit activin- or inhibin- 
related activities. Inhibins are characterized by their ability to inhibit the release of follicle 
stimulating hormone (FSH), while activins are characterized by their ability to stimulate 
the release of folic stimulating hormone (FSH). Thus, a protein of the present invention, 

20 alone or in heterodimers with a member of the inhibin a family, may be useful as a 
contraceptive based on the ability of inhibins to decrease fertility in female mammals and 
decrease spermatogenesis in male mammals. Administration of sufficient amounts of 
other inhibins can induce infertility in these mammals. Alternatively, the protein of the 
invention, as a homodimer or as a heterodimer with other protein subimits of the inhibin-B 

25 group, may be useful as a fertility inducing therapeutic, based upon the ability of activin 
molecules in stimulating FSH release from cells of the anterior pituitary. See, for 
example, United States Patent 4,798,885, the disclosure of which is incorporated herein by 
reference. A protein of the invention may also be useful for advancement of the onset of 
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fertility in sexually immature mammals, so as to increase the lifetime reproductive 
performance of domestic animals such as cows, sheep and pigs. 

Alternatively, as described in more detail below, genes encoding these proteins or 
nucleic acids regulating the expression of these proteins may be introduced into 
5 appropriate host cells to increase or decrease the expression of the proteins as desired, 

EXAMPLE 36A 
Assaying the Proteins Expressed from Extended cDNAs or 
Portions Thereof for Chemotactic/ChemokineticActivitv 
The proteins encoded by the extended cDNAs or portions thereof may also be evaluated 
10 for chemotactic/chemokinetic activity. For example, a protein of the present invention 
may have chemotactic or chemokinetic activity (e.g., act as a chemokine) for mammalian 
cells, including, for example, monocytes, fibroblasts, neutrophils, T-cells, mast cells, 
eosinophils, epithelial and/or endothelial cells. Chemotactic and chmokinetic proteins can 
be used to mobilize or attract a desired cell population to a desired site of action. 
15 Chemotactic or chemokinetic proteins provide particular advantages in treatment of 
wounds and other trauma to tissues, as well as in treatment of localized infections. For 
example, attraction of lymphocytes, monocytes or neutrophils to tumors or sites of 
infection may result in improved immune responses against the tumor or infecting agent. 

A protein or peptide has chemotactic activity for a particular cell population if it 
20 can stimulate, directly or indirectly, the directed orientation or movement of such cell 
population. Preferably, the protein or peptide has the ability to directly stimulate directed 
movement of cells. Whether a particular protein has chemotactic activity for a population 
of cells can be readily determined by employing such protein or peptide in any known 
assay for cell chemotaxis. 
25 The activity of a protein of the invention may, among other means, be measured 

by the following methods: 

Assays for chemotactic activity (which will identify proteins that induce or prevent 
chemotaxis) consist of assays that measure the ability of a protein to induce the migration 
of cells across a membrane as well as the ability of a protein to induce the adhension of 
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one cell population to another cell population. Suitable assays for movement and 
adhesion include, without limitation, those described in: Current Protocols in 
Immunology, Ed by J.E. Coligan, A.M. Kruisbeek, D.H. Margulies, E.M. Shevach, W. 
Strober, Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 6.12, 
5 Measurement of alpha and beta Chemokincs 6.12,1-6.12.28; Taub et al. J. Clin. Invest. 
95:1370-1376, 1995; Lind et al. APMIS 103:140-146, 1995; Mueller et al Eur. J. 
Immunol. 25:1744-1748; Gruber et al. J. of Immunol. 152:5860-5867, 1994; Johnston et 
al. J. of Immunol, 153:1762-1768,1994. 

EXAMPLE 37 

10 Assaying the Proteins Expressed from Extended cDNAs or 

Portions Thereof for Regulation of Blood Clotting 
The proteins encoded by the extended cDNAs or portions thereof may also be 
evaluated for their effects on blood clotting. Numerous assays for such activity are 
familiar to those skilled in the art, including the assays disclosed in the following 

15 references, which are incorporated herein by reference: Linet et al., J. Clin. Pharmacol 
26:131-140, 1986; Burdick et al, Thrombosis Res, 45:413-419, 1987; Humphrey et al.. 
Fibrinolysis 5:71-79 (1991); Schaub, Prostaglandins 35:467-474, 1988. 

Those proteins which are involved in the regulation of blood clotting may then be 
formulated as pharmaceuticals and used to treat clinical conditions in which regulation of 

20 blood clotting is beneficial. For example, a protein of the invention may also exhibit 
hemostatic or thrombolytic activity. As a result, such a protein is expected to be useful in 
treatment of various coagulations disorders (including hereditary disorders, such as 
hemophilias) or to enhance coagulation and other hemostatic events in treating wounds 
resulting from trauma, surgery or other causes. A protein of the invention may also be 

25 useful for dissolving or inhibiting formation of thromboses and for treatment and 
prevention of conditions resulting therefrom (such as,for example, infarction of cardiac 
and central nervous system vessels (e.g., stroke). Alternatively, as described in more 
detail below, genes encoding these proteins or nucleic acids regulating the expression of 
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these proteins may be introduced into appropriate host cells to increase or decrease the 
expression of the proteins as desired. 

EXAMPLE 38 

Assaying the Proteins Expressed from Extended cDNAs or 
5 Portions Thereof for Involvement in Receptor/Ligandlnteractions 

The proteins encoded by the extended cDNAs or a portion thereof may also be 
evaluated for their involvement in receptor/ligand interactions. Numerous assays for such 
involvement are familiar to those skilled in the art, including the assays disclosed in the 
following references, which are incorporated herein by reference: Chapter 7.28 

10 (Measurement of Cellular Adhesion under Static Conditions 7.28.1-7.28.22) in Current 
Protocols in Immunology, J.E. Coligan et al. Eds. Greene Publishing Associates and 
Wiley-Interscience;Takai et al., Proc, Natl. Acad. Sci. USA 84:6864-6868, 1987; Bierer 
et al, J. Exp. Med. 168:1145-1156, 1988; Rosensteinet al, J. Exp. Med. 169:149-160, 
1989; Stoltenborg et al., J. Immunol. Methods 175:59-68, 1994; Stitt et al.. Cell 80:661- 

15 670, 1995; Gyurisetal.,Ce// 75:791-803, 1993. 

For example, the proteins of the present invention may also demonstrate activity as 
receptors, receptor ligands or inhibitors or agonists of receptor/ligand interactions. 
Examples of such receptors and ligands include, without limitation, cytokine receptors and 
their ligands, receptor kinases and their ligands, receptor phosphatases and their ligands, 

20 receptors involved in cell-cell interactions and their ligands (including without limitation, 
cellular adhesion molecules (such as selectins, integrins and their ligands) and 
receptor/ligand pairs involved in antigen presentation, antigen recognition and 
development of cellular and humoral immune respones). Receptors and ligands are also 
useful for screening of potential peptide or small molecule inhibitors of the relevant 

25 receptor/ligand interaction. A protein of the present invention (including, without 
limitation, fragments of receptors and ligands) may themselves be useful as inhibitors of 
receptor/ligand interactions. 

EXAMPLE 38A 

Assaying the Proteins Expressed from Extended cDNAs or Portions 
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Thereof for Anti-Inflammatory Activity 
The proteins encoded by the extended cDNAs or a portion thereof may also be 
evaluated for anti-inflammatory activity. The anti-inflammatory activity may be achieved 
by providing a stimulus to cells involved in the inflammatory response, by inhibiting or 
5 promoting cell-cell interactions (such as, for example, cell adhesion), by inhibiting or 
promoting chemotaxis of cells involved in the inflammatory process, inhibiting or 
promoting cell extravasation, or by stimulating or suppressing production of other factors 
which more directly inhibit or promote an inflammatory response. Proteins exhibiting 
such activities can be used to treat inflammatory conditions including chronic or acute 

10 conditions), including v^thout limitation inflammation associated with infection (such as 
septic shock, sepsis or systemic inflammatory response syndrome (SIRS)),ischemia- 
reperfusioninury, endotoxin lethality, arthritis, complement-mediated hyperacute 
rejection, nephritis, cytokine or chemokine-induced lung injury, inflammatory bowel 
disease, Crohn's disease or resulting from over production of cytokines such as TNF or IL- 

15 L Proteins of the invention may also be useful to treat anaphylaxis and hypersensitivity to 
an antigenic substance or material. 

EXAMPLE 38B 

Assaying the Proteins Expressed from Extended cDNAs or 
Portions Thereof for Tumor Inhibition Activity 

20 The proteins encoded by the extended cDNAs or a portion thereof may also be 

evaluated for tumor inhibition activity. In addition to the activities described above for 
immunological treatment or prevention of tumors, a protein of the invention may exhibit 
other anti-tumor activities. A protein may inhibit tumor growth directly or indirectly 
(such as, for example, via ADCC). A protein may exhibit its tumor inhibitory activity by 

25 acting on tumor tissue or tumor precursor tissue, by inhibiting formation of tissues 
necessary to support tumor growth (such as, for example, by inhibiting angiogenesis), by 
causing production of other factors, agents or cell types which inhibit tumor growth, or by 
suppressing, eliminating or inhibiting factors, agents or cell types which promote tumor 
growth. 

-96- 



A protein of the invention may also exhibit one or more of the following 
additional activities or effects: inhibiting the growth, infection or fimction of, or killmg, 
infectious agents, including, without limitation, bacteria, viruses, fimgi and other parasites; 
effecting (suppressing or enhancing) bodily characteristics, including, without limitation, 
5 height, weight, hak color, eye color, skin, fat to lean ratio or other tissue pigmentation, or 
organ or body part size or shape (such as, for example, breast augmentation or diminution, 
change in bone form or shape); effecting biorhythms or circadian cycles or rhythms; 
effectmg the fertility of male or female subjects; effecting the metaboUsm, catabolism, 
anabolism, processing, utilization, storage or elimination of dietary fat, lipid, protein, 
10 carbohydrate, vitamins, minerals, cofactors or other nutritional factors or component(s); 
effecting behavioral characteristics, including, without limitation, appetite, libido, stress, 
cognition (including cognitive disorders), depression (including depressive disorders) and 
violent behaviors; providing analgesic effects or other pain reducing effects; promoting 
differentiation and growtii of embryonic stem cells in lineages other than hematopoietic 
15 lineages; hormonal or endocrine activity; in the case of enzymes, correcting deficiencies 
of the enzyme and treating deficiency-related diseases; treatment of hyperproliferative 
disorders (such as, for example, psoriasis); immunoglobulin-like activity (such as, for 
example, the ability to bind antigens or complement); and the ability to act as an antigen in 
a vaccine composition to raise an immune response against such protein or another 
20 material or entity which is cross-reactive with such protein. 

EXAMPLE 39 
Tdentificationof Proteins which Interact with 
Polvpeptides Encoded bv Extended cDNAs 
Proteins which interact with the polypeptides encoded by extended cDNAs or 
25 portions thereof, such as receptor proteins, may be identified using two hybrid systems 
such as the Matchmaker Two Hybrid System 2 (Catalog No. Kl 604-1, Clontech). As 
described in the manual accompanying the Matchmaker Two Hybrid System 2 (Catalog 
No. Kl 604-1, Clontech), which is incorporated herein by reference, the extended cDNAs 
or portions thereof, are inserted into an expression vector such that they are in frame with 



DNA encoding the DNA binding domain of the yeast transcriptional activator GAL4. 
cDNAs in a cDNA library which encode proteins which might interact with the 
polypeptides encoded by the extended cDNAs or portions thereof are inserted into a 
second expression vector such that they are in frame with DNA encoding the activation 
5 domain of GAL4. The two expression plasmids are transformed into yeast and the yeast 
are plated on selection medium which selects for expression of selectable markers on each 
of the expression vectors as well as GAL4 dependent expression of the HIS3 gene. 
Transformants capable of growing on medium lacking histidine are screened for GAL4 
dependent lacZ expression. Those cells which are positive in both the histidine selection 
1 0 and the lacZ assay contain plasmids encoding proteins which interact with the polypeptide 
encoded by the extended cDNAs or portions thereof 

Alternatively, the system described in Lustig et al., Methods in Enzymology 283: 
83-99 (1997), the disclosure of which is incorporated herein by reference, may be used for 
identifying molecules which interact with the polypeptides encoded by extended cDNAs. 
15 In such systems, in vitro transcription reactions are performed on a pool of vectors 
containing extended cDNA inserts cloned downstream of a promoter which drives in vitro 
transcription. The resulting pools of mRNAs are introduced into Ze«opt/5 laevis oocytes. 
The oocytes are then assayed for a desired acitivity . 

Altematively, the pooled in vitro transcription products produced as described 
20 above may be translated in vitro. The pooled in vitro translation products can be assayed 
for a desired activity or for interaction with a known polypeptide. 

Proteins or other molecules interacting with polypeptides encoded by extended 
cDNAs can be found by a variety of additional techniques. In one method, affinity 
columns containing the polypeptide encoded by the extended cDNA or a portion thereof 
25 can be constructed. In some versions, of this method the affinity colunm contains 
chimeric proteins in which the protein encoded by the extended cDNA or a portion 
thereof is fiised to glutathione S-transferase. A mixture of cellular proteins or pool of 
expressed proteins as described above and is applied to the affinity column. Proteins 
interacting with the polypeptide attached to the column can then be isolated and 
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analyzed on 2-D electrophoresis gel as described in Ramimsen et al. Electrophoresis, 
18, 588-598 (1997), the disclosure of which is incorporated herein by reference. 
Alternatively, the proteins retained on the affinity column can be purified by 
electrophoresis based methods and sequenced. The same method can be used to isolate 
5 antibodies, to screen phage display products, or to screen phage display human 
antibodies. 

Proteins interacting with polypeptides encoded by extended cDNAs or portions 
thereof can also be screened by using an Optical Biosensor as described in Edwards & 
Leatherbarrow, Analytical Biochemistry, 246, 1-6 (1997), the disclosure of which is 

10 incorporated herein by reference. The main advantage of the method is that it allows 
the determination of the association rate between the protein and other interacting 
molecules. Thus, it is possible to specifically select interacting molecules with a high or 
low association rate. Typically a target molecule is linked to the sensor surface 
(through a carboxymethl dextran matrix) and a sample of test molecules is placed in 

1 5 contact with the target molecules. The binding of a test molecule to the target molecule 
causes a change in the refractive index and/ or thickness. This change is detected by the 
Biosensor provided it occurs in the evanescent field (which extend a few hundred 
manometers from the sensor surface). In these screening assays, the target molecule can 
be one of the polypeptides encoded by extended cDNAs or a portion thereof and the test 

20 sample can be a collection of proteins extracted from tissues or cells, a pool of 

expressed proteins, combinatorial peptide and/ or chemical libraries,or phage displayed 
peptides. The tissues or cells fi:om which the test proteins are extracted can originate 
firom any species. 

In other methods, a target protein is immobilized and the test population is a 
25 collection of unique polypeptides encoded by the extended cDNAs or portions thereof 
To study the interaction of the proteins encoded by the extended cDNAs or 
portions thereof with drugs, the microdialysis coupled to HPLC method described by 
Wang et al, Chromatographia, 44, 205-208(1997) or the affinity capillary 
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electrophoresis method described by Busch et aL, J. Chromatogr. 777:311-328 (1997), 
the disclosures of which are incorporated herein by referenc can be used. 

The system described in U.S. Patent No. 5,654,150, the disclosure of which is 
incorporated herein by reference, may also be used to identify molecules which interact 

5 with the polypeptides encoded by the extended cDNAs. In this system, pools of extended 
cDNAs are transcribed and translated in vitro and the reaction products are assayed for 
interaction with a known polypeptide or antibody. 

It will be appreciated by those skilled in the art that the proteins expressed from 
the extended cDNAs or portions may be assayed for numerous activities in addition to 

10 those specifically enumerated above. For example, the expressed proteins may be 
evaluated for applications involving control and regulation of inflanmiation, tumor 
proliferation or metastasis, infection, or other clinical conditions. In addition, the proteins 
expressed from the extended cDNAs or portions thereof may be useful as nutritional 
agents or cosmetic agents. 

1 5 The proteins expressed from the extended cDNAs or portions thereof may be used 

to generate antibodies capable of specifically binding to the expressed protein or 
fragments thereof as described in Example 40 below. The antibodies may capable of 
binding a fiill length protein encoded by one of the sequences of SEQ ID NOs. 40-140 and 
242-377, a matiire protein encoded by one of the sequences of SEQ ID NOs. 40-140 and 

20 242-377, or a signal peptide encoded by one of the sequences of SEQ ID Nos. 40-140 and 
242-377. Alternatively, the antibodies may be capable of binding fragments of the 
proteins expressed from the extended cDNAs which comprise at least 10 amino acids of 
the sequences of SEQ ID NOs: 141-241 and 378-513. In some embodiments, the 
antibodies may be capable of binding fragments of the proteins expressed from the 

25 extended cDNAs which comprise at least 15 amino acids of the sequences of SEQ ID 
NOs: 141-241 and 378-513. In other embodiments, the antibodies may be capable of 
binding fragments of the proteins expressed from the extended cDNAs which comprise at 
least 25 amino acids of the sequences of SEQ ID NOs: 141-241 and 378-513. In fiirther 
embodiments, the antibodies may be capable of binding fragments of the proteins 
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expressed from the extended cDNAs which comprise at least 40 amino acids of the 
sequences of SEQ ID NOs: 141-241 and 378-513. 

EXAMPLE 40 
Production of an Antibody to a Human Protein 
Substantially pure protein or polypeptide is isolated from the transfected or 
transformed cells as described in Example 30. The concentration of protein in the fmal 
preparation is adjusted, for example, by concentration on an Amicon filter device, to the 
level of a few micrograms/ml. Monoclonal or polyclonal antibody to the protein can then 
be prepared as follows: 

A- Monoclonal Antibody Production by Hy bridoma Fusion 

Monoclonal antibody to epitopes of any of the peptides identified and isolated as 
described can be prepared from murine hybridomas according to the classical method of 
Kohler, G. and Milstein, Nature 256:495 (1975) or derivative methods thereof 
Briefly, a mouse is repetitively inoculated with a few micrograms of the selected protein 
or peptides derived therefrom over a period of a few weeks. The mouse is then sacrificed, 
and the antibody producing cells of the spleen isolated. The spleen cells are fused by 
means of polyethylene glycol with mouse myeloma cells, and the excess unfused cells 
destroyed by growth of the system on selective media comprising aminopterin (HAT 
media). The successfully fused cells are diluted and aliquots of the dilution placed in 
wells of a microtiter plate where growth of the culture is continued. Antibody-producing 
clones are identified by detection of antibody in the supernatant fluid of the wells by 
immunoassay procedures, such as Elisa, as originally described by Engvall, E., Meth. 
EnzymoL 70:419 (1980), and derivative methods thereof Selected positive clones can be 
expanded and their monoclonal antibody product harvested for use. Detailed procedures 
for monoclonal antibody production are described in Davis, L. et al. Basic Methods in 
Molecular Biology Elsevier, New York. Section 21-2. 
B. Polyclonal Antibody Production by Immunization 

Polyclonal antiserum containing antibodies to heterogenous epitopes of a single 
protein can be prepared by immunizing suitable animals with the expressed protein or 
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peptides derived therefrom described above, which can be unmodified or modified to 
enhance immxinogenicity. Effective polyclonal antibody production is affected by many 
factors related both to the antigen and the host species. For example, small molecules tend 
to be less immunogenic than others and may require the use of carriers and adjuvant. 

5 Also, host animals vary in response to site of inoculations and dose, with both inadequate 
or excessive doses of antigen resulting in low titer antisera. Small doses (ng level) of 
antigen administered at multiple intradermal sites appears to be most reliable. An 
effective immunization protocol for rabbits can be found in Vaitukaitis, J. et al. J. Clin, 
Endocrinol. Metab. 33:988-991 (1971). 

1 0 Booster inj ections can be given at regular intervals, and antiserum harvested when 

antibody titer thereof, as determined semi-quantitatively, for example, by double 
immunodiffusion in agar against known concentrations of the antigen, begins to fall. See, 
for example, Ouchterlony, O. et al. Chap. 19 in: Handbook of Experimental 
Immunology D. Wier (ed) Blackwell (1 973). Plateau concentration of antibody is usually 

15 in the range of 0.1 to 0.2 mg/ml of serum (about 12 laM). Affinity of the antisera for the 
antigen is determined by preparing competitive binding curves, as described, for example, 
by Fisher, D., Chap. 42 in: Manual of Clinical Immunology, 2d Ed. (Rose and 
Friedman, Eds.) Amer. Soc. For Microbiol., Washington, D.C. (1980). 

Antibody preparations prepared according to either protocol are useful in 

20 quantitative immunoassays which determine concentrations of antigen-bearing substances 
in biological samples; they are also used semi-quantitatively or qualitatively to identify the 
presence of antigen in a biological sample. The antibodies may also be used in therapeutic 
compositions for killing cells expressing the protein or reducing the levels of the protein in 
the body. 

25 V. Use of Extended cDNAs or Portions Thereof as Reagents 

The extended cDNAs of the present invention may be used as reagents in isolation 
procedures, diagnostic assays, and forensic procedures. For example, sequences from the 
extended cDNAs (or genomic DNAs obtainable therefrom) may be detectably labeled and 
used as probes to isolate other sequences capable of hybridizing to them. In addition, 
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sequences from the extended cDNAs (or genomic DNAs obtainable therefrom) may be 
used to design PGR primers to be used in isolation, diagnostic, or forensic procedures. 

EXAMPLE 41 
Preparation of PGR Primers and Amplification of DNA 

5 The extended cDNAs (or genomic DNAs obtainable therefrom) may be used to 

prepare PGR primers for a variety of applications, including isolation procedures for 
cloning nucleic acids capable of hybridizing to such sequences, diagnostic techniques and 
forensic techniques. The PGR primers are at least 10 bases, and preferably at least 12, 15, 
or 17 bases in length. More preferably, the PGR primers are at least 20-30 bases in length. 

10 In some embodiments, the PGR primers may be more than 30 bases in length. It is 
preferred that the primer pairs have approximately the same G/G ratio, so that melting 
temperatures are approximately the same. A variety of PGR techniques are famiUar to 
those skilled in the art. For a review of PGR technology, see Molecular Gloning to 
Genetic Engineering White, B.A. Ed. in Methods in Molecular Biology 67: Humana 

15 Press, Totowa 1997. In each of these PGR procedures, PGR primers on either side of the 
nucleic acid sequences to be amplified are added to a suitably prepared nucleic acid 
sample along with dNTPs and a thermostable polymerase such as Taq polymerase, Pfu 
polymerase, or Vent polymerase. The nucleic acid in the sample is denatured and the 
PGR primers are specifically hybridized to complementary nucleic acid sequences in the 

20 sample. The hybridized primers are extended. Thereafter, another cycle of denaturation, 
hybridization, and extension is initiated. The cycles are repeated multiple times to 
produce an amplified fragment containing the nucleic acid sequence between the primer 
sites. 

EXAMPLE 42 

25 Use of Extended cDNAs as Probes 

Probes derived from extended cDNAs or portions thereof (or genomic DNAs 
obtainable therefrom) may be labeled with detectable labels familiar to those skilled in the 
art, including radioisotopes and non-radioactive labels, to provide a detectable probe. The 
detectable probe may be single stranded or double stranded and may be made using 
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techniques known in the art, including in vitro transcription, nick translation, or kinase 
reactions. A nucleic acid sample containing a sequence capable of hybridizing to the 
labeled probe is contacted with the labeled probe. If the nucleic acid in the sample is 
double stranded, it may be denatured prior to contacting the probe. In some applications, 
5 the nucleic acid sample may be immobilized on a surface such as a nitrocellulose or nylon 
membrane. The nucleic acid sample may comprise nucleic acids obtained from a variety 
of sources, including genomic DNA, cDNA libraries, RNA, or tissue samples. 

Procedures used to detect the presence of nucleic acids capable of hybridizing to 
the detectable probe include well known techniques such as Southem blotting, Northem 
10 blotting, dot blotting, colony hybridization, and plaque hybridization. In some 
applications, the nucleic acid capable of hybridizing to the labeled probe may be cloned 
into vectors such as expression vectors, sequencing vectors, or in vitro transcription 
vectors to facilitate the characterization and expression of the hybridizing nucleic acids in 
the sample. For example, such techniques may be used to isolate and clone sequences in a 
1 5 genomic library or cDNA library which are capable of hybridizing to the detectable probe 
as described in Example 30 above. 

PGR primers made as described in Example 41 above may be used in forensic 
analyses, such as the DNA fingerprinting techniques described in Examples 43-47 below. 
Such analyses may utilize detectable probes or primers based on the sequences of the 
20 extended cDNAs isolated using the 5' ESTs (or genomic DNAs obtainable therefrom). 

EXAMPLE 43 
Forensic Matching by DNA Sequencing 
In one exemplary method, DNA samples are isolated from forensic specimens of, 
for example, hair, semen, blood or skin cells by conventional methods. A panel of PGR 
25 primers based on a mmiber of the extended cDNAs (or genomic DNAs obtainable 
therefrom), is then utilized in accordance with Example 41 to amplify DNA of 
approximately 100-200 bases in length from the forensic specimen. Corresponding 
sequences are obtained from a test subject. Each of these identification DNAs is then 
sequenced using standard techniques, and a simple database comparison determines the 
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differences, if any, between the sequences from the subject and those from the sample. 
Statistically significant differences between the suspect's DNA sequences and those from 
the sample conclusively prove a lack of identity. This lack of identity can be proven, for 
example, with only one sequence. Identity, on the other hand, should be demonstrated 
with a large number of sequences, all matching. Preferably, a minimum of 50 statistically 
identical sequences of 100 bases in length are used to prove identity between the suspect 
and the sample. 

EXAMPLE 44 

Positive Identification by DNA Sequencing 

The technique outlined in the previous example may also be used on a larger scale 
to provide a unique fingerprint-type identification of any individual. In this technique, 
primers are prepared from a large number of sequences from Table IV and the appended 
sequence listing. Preferably, 20 to 50 different primers are used. These primers are used 
to obtain a corresponding number of PCR-generated DNA segments from the individual 
in question in accordance with Example 41 . Each of these DNA segments is sequenced, 
using the methods set forth in Example 43. The database of sequences generated through 
this procedure uniquely identifies the individual from whom the sequences were obtained. 
The same panel of primers may then be used at any later time to absolutely correlate 
tissue or other biological specimen with that individual. 

EXAMPLE 45 
Southern Blot Forensic Identification 

The procedure of Example 44 is repeated to obtain a panel of at least 10 amplified 
sequences from an individual and a specimen. Preferably, the panel contains at least 50 
amplified sequences. More preferably, the panel contains 100 amplified sequences. In 
some embodiments, the panel contains 200 amplified sequences. This PCR-generated 
DNA is then digested with one or a combination of, preferably, four base specific 
restriction enzymes. Such enzymes are commercially available and known to those of 
skill in the art. After digestion, the resultant gene fragments are size separated in multiple 
duplicate wells on an agarose gel and transferred to nitrocellulose using Southern blotting 
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techniques well known to those with skill in the art. For a review of Southern blotting see 
Davis et al (Basic Methods in Molecular Biology . 1986, Elsevier Press, pp 62-65). 

A panel of probes based on the sequences of the extended cDNAs (or genomic 
DNAs obtainable therefrom), or fragments thereof of at least 1 0 bases, are radioactively or 

5 colorimetrically labeled using methods known in the art, such as nick translation or end 
labeling, and hybridized to the Southern blot using techniques known in the art (Davis et 
al, supra) . Preferably, the probe comprises at least 1 2, 1 5, or 1 7 consecutive nucleotides 
from the extended cDNA (or genomic DNAs obtainable therefrom). More preferably, the 
probe comprises at least 20-30 consecutive nucleotides from the extended cDNA (or 

10 genomic DNAs obtainable therefrom). In some embodiments, the probe comprises more 
than 30 nucleotides from the extended cDNA (or genomic DNAs obtainable therefrom). 
In other embodiments, the probe comprises at least 40, at least 50, at least 75, at least 100, 
at least 1 50, or at least 200 consecutive nucleotides from the extended cDNA (or genomic 
DNAs obtainable therefrom). 

1 5 Preferably, at least 5 to 1 0 of these labeled probes are used, and more preferably at 

least about 20 or 30 are used to provide a unique pattern. The resultant bands appearing 
from the hybridization of a large sample of extended cDNAs (or genomic DNAs 
obtainable therefrom) will be a unique identifier. Since the restriction enzyme cleavage 
will be different for every individual, the band pattern on the Southern blot will also be 

20 unique. Increasing the number of extended cDNA probes will provide a statistically 
higher level of confidence in the identification since there will be an increased number of 
sets of bands used for identification. 

EXAMPLE 46 

Dot Blot IdentificationProcedure 
25 Another technique for identifying individuals usmg the extended cDNA sequences 

disclosed herein utilizes a dot blot hybridization technique. 

Genomic DNA is isolated from nuclei of subject to be identified. Oligonucleotide 
probes of approximately 30 bp in length are synthesized that correspond to at least 10, 
preferably 50 sequences from the extended cDNAs or genomic DNAs obtainable 
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therefrom. The probes are used to hybridize to the genomic DNA through conditions 
known to those in the art. The oligonucleotides are end labeled with P^^ using 
polynucleotide kinase (Pharmacia). Dot Blots are created by spotting the genomic DNA 
onto nitrocellulose or the like using a vacuum dot blot manifold (BioRad, Richmond 

5 California). The nitrocellulose filter containing the genomic sequences is baked or UV 
linked to the filter, prehybridized and hybridized with labeled probe using techniques 
known in the art (Davis et al supra ). The ^^P labeled DNA fragments are sequentially 
hybridized with successively stringent conditions to detect minimal differences between 
the 30 bp sequence and the DNA. Tetramethylammonium chloride is useful for 

1 0 identifying clones containing small numbers of nucleotide mismatches (Wood et al., Proc. 
NatL Acad. ScL USA 82(6):1585-1588 (1985)) which is hereby incorporated by 
reference. A unique pattern of dots distinguishes one individual from another individual. 

Extended cDNAs or oligonucleotides containing at least 10 consecutive bases 
from these sequences can be used as probes in the following alternative fingerprinting 

15 technique. Preferably, the probe comprises at least 12, 15, or 17 consecutive nucleotides 
from the extended cDNA (or genomic DNAs obtainable therefrom). More preferably, the 
probe comprises at least 20-30 consecutive nucleotides from the extended cDNA (or 
genomic DNAs obtainable therefrom). In some embodiments, the probe comprises more 
than 30 nucleotides from the extended cDNA (or genomic DNAs obtainable therefrom). 

20 In other embodiments, the probe comprises at least 40, at least 50, at least 75, at least 1 00, 
at least 150, or at least 200 consecutive nucleotides from the extended cDNA (or genomic 
DNAs obtainable therefrom). 

Preferably, a plurality of probes having sequences from different genes are used in 
the alternative fingerprinting technique. Example 47 below provides a representative 

25 alternative fingerprinting procedure in which the probes are derived from extended 
cDNAs. 

EXAMPLE 47 

Alternative "Fingerprint" Identification Technique 
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20-mer oligonucleotides are prepared from a large number, e.g. 50, 100, or 200, of 
extended cDNA sequences (or genomic DNAs obtainable therefrom) using commercially 
available oligonucleotide services such as Genset, Paris, France. Cell samples from the 
test subject are processed for DNA using techniques well known to those with skill in the 
5 art. The nucleic acid is digested with restriction enzymes such as EcoRI and XbaL 
Following digestion, samples are applied to wells for electrophoresis. The procedure, as 
known in the art, may be modified to accommodate polyacrylamide electrophoresis, 
however in this example, samples containing 5 ug of DNA are loaded into wells and 
separated on 0.8% agarose gels. The gels are transferred onto nitrocelliilose using 

1 0 standard Southern blotting techniques. 

10 ng of each of the oligonucleotides are pooled and end-labeled with P^^. The 
nitrocellulose is prehybridized with blocking solution and hybridized with the labeled 
probes. Following hybridization and washing, the nitrocellulose filter is exposed to X- 
Omat AR X-ray film. The resulting hybridization pattem will be unique for each 

15 individual. 

It is additionally contemplated within this example that the number of probe 
sequences used can be varied for additional accuracy or clarity. 

The antibodies generated in Examples 30 and 40 above may be used to identify the 
tissue type or cell species from which a sample is derived as described above. 
20 EXAMPLE 48 

Identification of Tissue Types or Cell Species by Means of 
Labeled Tissue Specific Antibodies 
Identification of specific tissues is accomplished by the visualization of tissue 
specific antigens by means of antibody preparations according to Examples 30 and 40 
25 which are conjugated, directly or indirectly to a detectable marker. Selected labeled 
antibody species bind to their specific antigen binding partner in tissue sections, cell 
suspensions, or in extracts of soluble proteins from a tissue sample to provide a pattem for 
qualitative or semi-qualitative interpretation. 
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Antisera for these procedures must have a potency exceeding that of the native 
preparation, and for that reason, antibodies are concentrated to a mg/ml level by isolation 
of the gamma globulin fraction, for example, by ion-exchange chromatography or by 
ammonium sulfate fractionation. Also, to provide the most specific antisera, unwanted 
5 antibodies, for example to common proteins, must be removed from the ganmia globulin 
fraction, for example by means of insoluble immunoabsorbents, before the antibodies are 
labeled with the marker. Either monoclonal or heterologous antisera is suitable for either 
procedure. 

A. ImmunohistochemicalTechniques 

10 Purified, high-titer antibodies, prepared as described above, are conjugated to a 

detectable marker, as described, for example, by Fudenberg, H., Chap. 26 in: Basic 503 
Clinical Immunology, 3rd Ed. Lange, Los Altos, California (1980) or Rose, N. et al., 
Chap. 12 in: Methods in Immunodiagnosis, 2d Ed. John Wiley 503 Sons, New York 
(1980). 

15 A fluorescent marker, either fluorescein or rhodamine, is preferred, but antibodies 

can also be labeled with an enzyme that supports a color producing reaction with a 
substrate, such as horseradish peroxidase. Markers can be added to tissue-bound antibody 
in a second step, as described below. Alternatively, the specific antitissue antibodies can 
be labeled vdth ferritin or other electron dense particles, and localization of the ferritin 

20 coupled antigen-antibody complexes achieved by means of an electron microscope. In yet 
another approach, the antibodies are radiolabeled, with, for example ^^^I, and detected by 
overlaying the antibody treated preparation with photographic emulsion. 

Preparations to carry out the procedures can comprise monoclonal or polyclonal 
antibodies to a single protein or peptide identified as specific to a tissue type, for example, 

25 brain tissue, or antibody preparations to several antigenically distinct tissue specific 
antigens can be used in panels, independently or in mixtures, as required. 

Tissue sections and cell suspensions are prepared for immunohistochemical 
examination according to common histological techniques. Multiple cryostat sections 
(about 4 jam, unfixed) of the unknown tissue and known control, are mounted and each 
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slide covered with different dilutions of the antibody preparation. Sections of known and 
unknown tissues should also be treated with preparations to provide a positive control, a 
negative control, for example, pre-immune sera, and a control for non-specific staining, 
for example, buffer. 

5 Treated sections are incubated in a humid chamber for 30 min at room 

temperature, rinsed, then washed in buffer for 30-45 min. Excess fluid is blotted away, 
and the marker developed. 

If the tissue specific antibody was not labeled in the first incubation, it can be 
labeled at this time in a second antibody-antibody reaction, for example, by adding 
10 fluorescein- or enzyme-conjugated antibody against the immunoglobuUn class of the 
antiserum-producing species, for example, fluorescein labeled antibody to mouse IgG. 
Such labeled sera are commercially available. 

The antigen found in the tissues by the above procedure can be quantified by 
measuring the intensity of color or fluorescence on the tissue section, and calibrating that 
1 5 signal using appropriate standards. 

B. Identification of Tissue Specific Soluble Proteins 

The visualization of tissue specific proteins and identification of unknown tissues 
from that procedure is carried out using the labeled antibody reagents and detection 
strategy as described for immunohistochemistry; however the sample is prepared 
20 according to an electrophoretic technique to distribute the proteins extracted from the 
tissue in an orderly array on the basis of molecular weight for detection. 

A tissue sample is homogenized using a Virtis apparatus; cell suspensions are 
disrupted by Bounce homogenization or osmotic lysis, using detergents in either case as 
required to disrupt cell membranes, as is the practice in the art. Insoluble cell components 
25 such as nuclei, microsomes, and membrane fragments are removed by ultracentrifiigation, 
and the soluble protein-containing fraction concentrated if necessary and reserved for 
analysis. 

A sample of the soluble protein solution is resolved into individual protein species 
by conventional SDS polyacrylamide electrophoresis as described, for example, by Davis, 
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L. et al., Section 19-2 in: Basic Methods in Molecular Biology (P. Leder, ed), Elsevier, 
New York (1986), using a range of amounts of polyacrylamide in a set of gels to resolve 
the entire molecular weight range of proteins to be detected in the sample. A size marker 
is run in parallel for purposes of estimating molecular weights of the constituent proteins. 
5 Sample size for analysis is a convenient volume of from 5 to55 |j.l, and containing from 
about 1 to 100 |Lig protein. An aliquot of each of the resolved proteins is transferred by 
blotting to a nitrocellulose filter paper, a process that maintains the pattern of resolution. 
Multiple copies are prepared. The procedure, known as Western Blot Analysis, is well 
described in Davis, L. et al., (above) Section 19-3. One set of nitrocellulose blots is 

10 stained with Coomassie Blue dye to visualize the entire set of proteins for comparison 
with the antibody bound proteins. The remaining nitrocellulose filters are then incubated 
with a solution of one or more specific antisera to tissue specific proteins prepared as 
described in Examples 30 and 40. In this procedure, as in procedure A above, appropriate 
positive and negative sample and reagent controls are run. 

1 5 In either procedure A or B, a detectable label can be attached to the primary tissue 

antigen-primary antibody complex according to various strategies and permutations 
thereof. In a straightforward approach, the primary specific antibody can be labeled; 
altematively, the unlabeled complex can be bound by a labeled secondary anti-IgG 
antibody. In other approaches, either the primary or secondary antibody is conjugated to a 

20 biotin molecule, which can, in a subsequent step, bind an avidin conjugated marker. 
According to yet another strategy, enzyme labeled or radioactive protein A, which has the 
property of binding to any IgG, is bound in a final step to either the primary or secondary 
antibody. 

The visualization of tissue specific antigen binding at levels above those seen in 
25 control tissues to one or more tissue specific antibodies, prepared from the gene sequences 
identified from extended cDNA sequences, can identify tissues of unknown origin, for 
example, forensic samples, or differentiated tumor tissue that has metastasized to foreign 
bodily sites. 
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In addition to their applications in forensics and identification, extended cDNAs 
(or genomic DNAs obtainable therefrom) may be mapped to their chromosomal locations. 
Example 49 below describes radiation hybrid (RH) mapping of human chromosomal 
regions using extended cDNAs. Example 50 below describes a representative procedure 
5 for mapping an extended cDNA (or a genomic DNA obtainable therefrom) to its location 
on a human chromosome. Example 5 1 below describes mapping of extended cDNAs (or 
genomic DNAs obtainable therefrom) on metaphase chromosomes by Fluorescence In 
Situ Hybridization (FISH). 

EXAMPLE 49 

10 Radiation hybrid mapping of Extended cDNAs to the human genome 

Radiation hybrid (RH) mapping is a somatic cell genetic approach that can be used 
for high resolution mapping of the human genome. In this approach, cell lines containing 
one or more human chromosomes are lethally irradiated, breaking each chromosome into 
fragments whose size depends on the radiation dose. These fragments are rescued by 

15 fixsion with cultured rodent cells, yielding subclones containing different portions of the 
hxunan genome. This technique is described by Benham et aL {Genomics 4:509-517, 
1989) and Cox et aL, {Science 250:245-250, 1990), the entire contents of which are hereby 
incorporated by reference. The random and independent nature of the subclones permits 
efficient mapping of any human genome marker. Hviman DNA isolated from a panel of 

20 80-100 cell lines provides a mapping reagent for ordering extended cDNAs (or genomic 
DNAs obtainable therefrom). In this approach, the frequency of breakage between 
markers is used to measure distance, allowing construction of fine resolution maps as has 
been done using conventional ESTs (Schuler et al.. Science 274:540-546, 1996, hereby 
incorporated by reference). 

25 RH mapping has been used to generate a high-resolution whole genome radiation 

hybrid map of human chromosome 17q22-q25.3 across the genes for grovrth hormone 
(GH) and thymidine kinase (TK) (Foster et al. Genomics 33:185-192, 1996), the region 
surrounding the Gorlin syndrome gene (Obermayr et al., Eur. J. Hum. Genet 4:242-245, 
1996), 60 loci covering the entire short arm of chromosome 12 (Raeymaekers et al, 
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Genomics 29:170-178, 1995), the region of human chromosome 22 containing the 
neurofibromatosis type 2 locus (Frazer et al.. Genomics 14:574-584, 1992) and 13 loci on 
the long arm of chromosome5 (Warringtonet al., Genomics 11:701-708, 1991). 

EXAMPLE 50 

5 Mapping of Extended cDNAs to Human 

Chromosomes using PCR techniques 
Extended cDNAs (or genomic DNAs obtainable therefrom) may be assigned to 
human chromosomes using PCR based methodologies. In such approaches, 
oligonucleotide primer pairs are designed from the extended cDNA sequence (or the 

1 0 sequence of a genomic DNA obtainable therefrom) to minimize the chance of amplifying 
through an intron. Preferably, the oligonucleotide primers are 18-23 bp in length and are 
designed for PCR amplification. The creation of PCR pruners from known sequences is 
well known to those with skill in the art. For a review of PCR technology see Erlich, 
H.A., PCR Technology; Principles and Applications for DNA Amplification . 1992. 

1 5 W.H. Freeman and Co., New York. 

The primers are used in polymerase chain reactions (PCR) to amplify templates 
from total human genomic DNA. PCR conditions are as follows: 60 ng of genomic DNA 
is used as a template for PCR with 80 ng of each oligonucleotide primer, 0.6 unit of Taq 
polymerase, and 1 jiCu of a ^^P-labeled deoxycytidine triphosphate. The PCR is 

20 performed in a microplate thermocycler (Techne) under the following conditions: 30 
cycles of 94°C, 1 .4 min; 55°C, 2 min; and 72°C, 2 min; with a final extension at 72°C for 
1 0 min. The amplified products are analyzed on a 6% polyacrylamide sequencing gel and 
visualized by autoradiography. If the length of the resulting PCR product is identical to 
the distance between the ends of the primer sequences in the extended cDNA from which 

25 the primers are derived, then the PCR reaction is repeated with DNA templates from two 
panels of human-rodent somatic cell hybrids, BIOS PCRable DNA (BIOS Corporation) 
and NIGMS Human-Rodent Somatic Cell Hybrid Mapping Panel Number 1 (NIGMS, 
Camden, NJ). 
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PCR is used to screen a series of somatic cell hybrid cell lines containing defined 
sets of human chromosomes for the presence of a given extended cDNA (or genomic 
DNA obtainable therefrom). DNA is isolated from the somatic hybrids and used as 
starting templates for PCR reactions using the primer pairs from the extended cDNAs (or 
5 genomic DNAs obtainable therefrom). Only those somatic cell hybrids with 
chromosomes containing the human gene corresponding to the extended cDNA (or 
genomic DNA obtainable therefrom) will yield an amplified fragment. The extended 
cDNAs (or genomic DNAs obtainable therefrom) are assigned to a chromosome by 
analysis of the segregation pattern of PCR products from the somatic hybrid DNA 
10 templates. The single human chromosome present in all cell hybrids that give rise to an 
amplified fragment is the chromosome containing that extended cDNA (or genomic DNA 
obtainable therefrom). For a review of techniques and analysis of results from somatic 
cell gene mapping experiments. (See Ledbetter et al., Genomics 6:475-481 (1990).) 

Alternatively, the extended cDNAs (or genomic DNAs obtainable therefrom) may 
15 be mapped to individual chromosomes using FISH as described in Example 5 1 below. 

EXAMPLES! 
Mapping of Extended 5' ESTs to Chromosomes 
Using Fluorescence in situ Hybridization 
Fluorescence in situ hybridization allows the extended cDNA (or genomic DNA 
20 obtainable therefrom) to be mapped to a particular location on a given chromosome. The 
chromosomes to be used for fluorescence in situ hybridization techniques may be obtained 
from a variety of sources including cell cultures, tissues, or whole blood. 

In a preferred embodiment, chromosomal localization of an extended cDNA (or 
genomic DNA obtainable therefrom) is obtained by FISH as described by Cherif et al. 
25 (Proc. Natl. Acad Sci USA., 87:6639-6643, 1990). Metaphase chromosomes are 
prepared from phytohemagglutinin(PHA)-stimulated blood cell donors. PHA-stimulated 
lymphocytes from healthy males are cultured for 72 h in RPMI-1640 medium. For 
synchronization, methotrexate (10 |liM) is added for 17 h, followed by addition of 5- 
bromodeoxyuridine(5-BudR, 0.1 mM) for 6 h. Colcemid (1 |ig/ml) is added for the last 
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1 5 min before harvesting the cells. Cells are collected, washed in RPMI, incubated with a 
hypotonic solution of KCl (75 mM) at 37°C for 15 min and fixed in three changes of 
methanolraceticacid (3:1). The cell suspension is dropped onto a glass slide and air dried. 
The extended cDNA (or genomic DNA obtainable therefirom) is labeled with biotin-16 
5 dUTP by nick translation according to the manufacturer's instructions (Bethesda Research 
Laboratories, Bethesda, MD), purified using a Sephadex G-50 column (Pharmacia, 
Upssala, Sweden) and precipitated. Just prior to hybridization, the DNA pellet is 
dissolved in hybridization buffer (50% formamide, 2 X SSC, 10% dextran sulfate, 1 
mg/ml sonicated salmon sperm DNA, pH 7) and the probe is denatured at 70°C for 5-10 
10 min. 

Slides kept at -20°C are treated for 1 h at 3TC with RNase A (100 |ig/ml), rinsed 
three times in 2 X SSC and dehydrated in an ethanol series. Chromosome preparations are 
denatured in 70% formamide, 2 X SSC for 2 min at 70°C, then dehydrated at 4°C. The 
slides are treated with proteinase K (10 |ig/100 ml in 20 mM Tris-HCl, 2 mM CaCy at 

1 5 37*^C for 8 min and dehydrated. The hybridization mixture containing the probe is placed 
on the slide, covered with a coverslip, sealed with rubber cement and incubated ovemight 
in a humid chamber at 37°C. After hybridization and post-hybridization washes, the 
biotinylated probe is detected by avidin-FITC and amplified wdth additional layers of 
biotinylated goat anti-avidin and avidin-FITC. For chromosomal localization, fluorescent 

20 R-bands are obtained as previously described (Cherif et al., supra.). The slides are 
observed under a LEICA fluorescence microscope (DMRXA). Chromosomes are 
coxmterstained with propidium iodide and the fluorescent signal of the probe appears as 
two symmetrical yellow-green spots on both chromatids of the fluorescent R-band 
chromosome (red). Thus, a particular extended cDNA (or genomic DNA obtainable 

25 therefrom) may be localized to a particular cytogenetic R-band on a given chromosome. 

Once the extended cDNAs (or genomic DNAs obtainable therefrom) have been 
assigned to particular chromosomes using the techniques described in Examples 49-51 
above, they may be utilized to construct a high resolution map of the chromosomes on 
which they are located or to identify the chromosomes in a sample. 
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EXAMPLE 52 

Use of Extended cDNAs to Construct or Expand Chromosome Maps 
Chromosome mapping involves assigning a given imique sequence to a particular 
chromosome as described above. Once the unique sequence has been mapped to a given 
5 chromosome, it is ordered relative to other unique sequences located on the same 
chromosome. One approach to chromosome mapping utilizes a series of yeast artificial 
chromosomes (YACs) bearing several thousand long inserts derived from the 
chromosomes of the organism from which the extended cDNAs (or genomic DNAs 
obtainable therefrom) are obtained. This approach is described in Ramaiah Nagaraja et al. 

1 0 Genome Research 7:2 1 0-222, March 1 997. Briefly, in this approach each chromosome 
is broken into overlapping pieces which are inserted into the YAC vector. The YAC 
inserts are screened using PCR or other methods to determine whether they include the 
extended cDNA (or genomic DNA obtainable therefrom) whose position is to be 
determined. Once an insert has been found which includes the extended cDNA (or 

1 5 genomic DNA obtainable therefrom), the insert can be analyzed by PCR or other methods 
to determine whether the insert also contains other sequences knovm to be on the 
chromosome or in the region from which the extended cDNA (or genomic DNA 
obtainable therefrom) was derived. This process can be repeated for each insert in the 
YAC library to determine the location of each of the extended cDNAs (or genomic DNAs 

20 obtainable therefrom) relative to one another and to other known chromosomal markers. 
In this way, a high resolution map of the distribution of numerous unique markers along 
each of the organisms chromosomes may be obtained. 

As described in Example 53 below extended cDNAs (or genomic DNAs 
obtainable therefrom) may also be used to identify genes associated with a particular 

25 phenotype, such as hereditary disease or drug response. 

EXAMPLE 53 

Identification of genes associated with hereditary diseases or drug response 
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This example illustrates an approach useful for the association of extended cDNAs 
(or genomic DNAs obtainable therefrom) with particular phenotypic characteristics. In 
this example, a particular extended cDNA (or genomic DNA obtainable therefrom) is used 
as a test probe to associate that extended cDNA (or genomic DNA obtainable therefrom) 
5 with a particular phenotypic characteristic. 

Extended cDNAs (or genomic DNAs obtainable therefrom) are mapped to a 
particular location on a human chromosome using techniques such as those described in 
Examples 49 and 50 or other techniques known in the art. A search of Mendelian 
Inheritance in Man (V. McKusick, Mendelian Inheritance in Man (available on line 

10 through Johns Hopkins University Welch Medical Library) reveals the region of the 
human chromosome which contains the extended cDNA (or genomic DNA obtainable 
therefrom) to be a very gene rich region containing several known genes and several 
diseases or phenotypes for which genes have not been identified. The gene corresponding 
to this extended cDNA (or genomic DNA obtainable therefrom) thus becomes an 

1 5 immediate candidate for each of these genetic diseases. 

Cells from patients with these diseases or phenotypes are isolated and expanded in 
culture. PGR primers from the extended cDNA (or genomic DNA obtainable therefrom) 
are used to screen genomic DNA, mRNA or cDNA obtained from the patients. Extended 
cDNAs (or genomic DNAs obtainable therefrom) that are not amplified in the patients can 

20 be positively associated with a particular disease by ftirther analysis. Altematively, the 
PCR analysis may yield fragments of different lengths when the samples are derived from 
an individual having the phenotype associated with the disease than when the sample is 
, derived from a healthy individual, indicating that the gene containing the extended cDNA 
may be responsible for the genetic disease. 

25 VL Use of Extended cDNAs (or genomic DNAs obtainable therefrom) to Construct 
Vectors 

The present extended cDNAs (or genomic DNAs obtainable therefrom) may also 
be used to construct secretion vectors capable of directing the secretion of the proteins 
encoded by genes inserted in the vectors. Such secretion vectors may facilitate the 
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purification or enrichment of the proteins encoded by genes inserted therein by reducing 
the number of background proteins from which the desired protein must be purified or 
enriched. Exemplary secretion vectors are described in Example 54 below, 

EXAMPLE 54 

5 Construction of Secretion Vectors 

The secretion vectors of the present invention include a promoter capable of 
directing gene expression in the host cell, tissue, or organism of interest. Such promoters 
include the Rous Sarcoma Virus promoter, the SV40 promoter, the human 
cytomegalovirus promoter, and other promoters familiar to those skilled in the art. 

10 A signal sequence from an extended cDNA (or genomic DNA obtainable 

therefrom), such as one of the signal sequences in SEQ ID NOs: 40-140 and 242-377 as 
defined in Table IV above, is operably linked to the promoter such that the mRNA 
transcribed from the promoter will direct the translation of the signal peptide. The host 
cell, tissue, or organism may be any cell, tissue, or organism which recognizes the signal 

15 peptide encoded by the signal sequence in the extended cDNA (or genomic DNA 
obtainable therefrom). Suitable hosts include mammalian cells, tissues or organisms, 
avian cells, tissues, or organisms, insect cells, tissues or organisms, or yeast. 

In addition, the secretion vector contains cloning sites for inserting genes encoding 
the proteins which are to be secreted. The cloning sites facilitate the cloning of the insert 

20 gene in frame with the signal sequence such that a fusion protein in which the signal 
peptide is fused to the protein encoded by the inserted gene is expressed from the mRNA 
transcribed from the promoter. The signal peptide directs the extracellular secretion of the 
fusion protein. 

The secretion vector may be DNA or RNA and may integrate into the 
25 chromosome of the host, be stably maintained as an extrachromosomal replicon in the 
host, be an artificial chromosome, or be transiently present in the host. Many nucleic acid 
backbones suitable for use as secretion vectors are known to those skilled in the art, 
including retroviral vectors, SV40 vectors, Bovine Papilloma Virus vectors, yeast 
integrating plasmids, yeast episomal plasmids, yeast artificial chromosomes, human 
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artificial chromosomes, P element vectors, baculovirus vectors, or bacterial plasmids 
capable of being transiently introduced into the host 

The secretion vector may also contain a polyA signal such that the polyA signal is 
located downstream of the gene inserted into the secretion vector 
5 After the gene encoding the protein for which secretion is desired is inserted into 

the secretion vector, the secretion vector is introduced into the host cell, tissue, or 
organism using calciimi phosphate precipitation, DEAE-Dextran, electroporation, 
liposome-mediatedtransfection, viral particles or as naked DNA. The protein encoded by 
the inserted gene is then purified or enriched from the supernatant using conventional 

10 techniques such as ammonium sulfate precipitation, immunoprecipitation, 
immunochromatography,size exclusion chromatography, ion exchange chromatography, 
and hplc. Alternatively, the secreted protein may be in a sufficiently enriched or pure state 
in the supernatant or growth media of the host to permit it to be used for its intended 
purpose without further enrichment. 

1 5 The signal sequences may also be inserted into vectors designed for gene therapy. 

In such vectors, the signal sequence is operably linked to a promoter such that mRNA 
transcribed firom the promoter encodes the signal peptide. A cloning site is located 
downstream of the signal sequence such that a gene encoding a protein whose secretion is 
desired may readily be inserted into the vector and fused to the signal sequence. The 

20 vector is introduced into an appropriate host cell. The protein expressed from the 
promoter is secreted extracellularly, thereby producing a therapeutic effect 

The extended cDNAs or 5' ESTs may also be used to clone sequences located 
upstream of the extended cDNAs or 5' ESTs which are capable of regulating gene 
expression, including promoter sequences, enhancer sequences, and other upstream 

25 sequences which influence transcription or translation levels. Once identified and cloned, 
these upstream regulatory sequences may be used in expression vectors designed to direct 
the expression of an inserted gene in a desired spatial, temporal, developmental, or 
quantitative fashion. Example 55 describes a method for cloning sequences upstream of 
the extended cDNAs or 5' ESTs. 



-119- 



EXAMPLE 55 

Use of Extended cDNAs or 5' ESTs to Clone Upstream 
Sequences from Genomic DNA 
Sequences derived from extended cDNAs or 5' ESTs may be used to isolate the 
5 promoters of the corresponding genes using chromosome walking techniques. In one 
chromosome walking technique, which utilizes the GenomeWalker™ kit available from 
Clontech, five complete genomic DNA samples are each digested with a different 
restriction enzyme which has a 6 base recognition site and leaves a blunt end. Following 
digestion, oligonucleotide adapters are ligated to each end of the resulting genomic DNA 
10 fragments. 

For each of the five genomic DNA libraries, a first PGR reaction is performed 
according to the manufacturer's instructions (which are incorporated herein by reference) 
using an outer adaptor primer provided in the kit and an outer gene specific primer. The 
gene specific primer should be selected to be specific for the extended cDNA or 5' EST of 

15 interest and should have a melting temperature, length, and location in the extended 
cDNA or ' EST which is consistent with its use in PGR reactions. Each first PGR reaction 
contains 5ng of genomic DNA, 5 jul of lOX Tth reaction buffer, 0.2 mM of each dNTP, 
0.2 |iM each of outer adaptor primer and outer gene specific primer, 1.1 mM of 
Mg(0Ac)2, and 1 |ul of the Tth polymerase SOX mix in a total volume of 50 [il. The 

20 reaction cycle for the first PGR reaction is as follows: 1 min @ 94'^G / 2 sec @ 94''G, 3 
min @ 72°G (7 cycles) / 2 sec @ 94°G, 3 min @ 6TC (32 cycles) / 5 min @ 6TQ. 

The product of the first PGR reaction is diluted and used as a template for a second 
PGR reaction according to the manufacturer's instructions using a pair of nested primers 
which are located internally on the amplicon resulting from the first PGR reaction. For 

25 example, 5 ^1 of the reaction product of the first PGR reaction mixture may be diluted 1 80 
times. Reactions are made in a 50 \i\ volume having a composition identical to that of the 
first PGR reaction except the nested primers are used. The first nested primer is specific 
for the adaptor, and is provided with the GenomeWalker™ kit. The second nested primer 
is specific for the particular extended cDNA or 5' EST for which the promoter is to be 
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cloned and should have a melting temperature, length, and location in the extended cDNA 
or 5' EST which is consistent with its use in PGR reactions. The reaction parameters of 
the second PGR reaction are as follows: 1 min @ 94^C / 2 sec @ 94°G, 3 min @ 72°C (6 
cycles) / 2 sec @ 3 min @ 6TC (25 cycles) / 5 min @ 6TC 
5 The product of the second PGR reaction is purified, cloned, and sequenced using 

standard techniques. Alternatively, two or more human genomic DNA libraries can be 
constructed by using two or more restriction enzymes. The digested genomic DNA is 
cloned into vectors which can be converted into single stranded, circular, or linear DNA. 
A biotinylated oligonucleotide comprising at least 15 nucleotides from the extended 
1 0 cDNA or 5' EST sequence is hybridized to the single stranded DNA. Hybrids between the 
biotinylated oligonucleotide and the single stranded DNA containing the extended cDNA 
or EST sequence are isolated as described in Example 29 above. Thereafter, the single 
stranded DNA containing the extended cDNA or EST sequence is released from the beads 
and converted into double stranded DNA using a primer specific for the extended cDNA 
15 or 5' EST sequence or a primer corresponding to a sequence included in the cloning 
vector. The resulting double stranded DNA is transformed into bacteria. DNAs 
containing the 5' EST or extended cDNA sequences are identified by colony PGR or 
colony hybridization. 

Once the upstream genomic sequences have been cloned and sequenced as 
20 described above, prospective promoters and transcription start sites within the upstream 
sequences may be identified by comparing the sequences upstream of the extended 
cDNAs or 5' ESTs with databases containing known transcription start sites, transcription 
factor binding sites, or promoter sequences. 

In addition, promoters in the upstream sequences may be identified using promoter 
25 reporter vectors as described in Example 56. 

EXAMPLE 56 
Identification of Promoters in Gloned Upstream Sequences 
The genomic sequences upstream of the extended cDNAs or 5' ESTs are cloned 
into a suitable promoter reporter vector, such as the pSEAP-Basic, pSEAP-Enhancer, 
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pPgal-Basic, pPgal-Enhancer, or pEGFP-1 Promoter Reporter vectors available from 
Clontech. Briefly, each of these promoter reporter vectors include multiple cloning sites 
positioned upstream of a reporter gene encoding a readily assayable protein such as 
secreted alkaline phosphatase, p galactosidase, or green fluorescent protein. The 
5 sequences upstream of the extended cDNAs or 5' ESTs are inserted into the cloning sites 
upstream of the reporter gene in both orientations and introduced into an appropriate host 
cell. The level of reporter protein is assayed and compared to the level obtained from a 
vector which lacks an insert in the cloning site. The presence of an elevated expression 
level in the vector containing the insert v^th respect to the control vector indicates the 
1 0 presence of a promoter in the insert. If necessary, the upstream sequences can be cloned 
into vectors v^hich contain an enhancer for augmenting transcription levels from weak 
promoter sequences. A significant level of expression above that observed with the vector 
lacking an insert indicates that a promoter sequence is present in the inserted upstream 
sequence. 

15 Appropriate host cells for the promoter reporter vectors may be chosen based on 

the results of the above described determination of expression patterns of the extended 
cDNAs and ESTs. For example, if the expression pattern analysis indicates that the 
mRNA corresponding to a particular extended cDNA or 5' EST is expressed in 
fibroblasts, the promoter reporter vector may be introduced into a human fibroblast cell 

20 line. 

Promoter sequences within the upstream genomic DNA may be further defined by 
constructing nested deletions in the upstream DNA using conventional techniques such as 
Exonuclease III digestion. The resulting deletion fragments can be inserted into the 
promoter reporter vector to determine whether the deletion has reduced or obliterated 
25 promoter activity. In this way, the boundaries of the promoters may be defined. If 
desired, potential individual regulatory sites within the promoter may be identified using 
site directed mutagenesis or linker scanning to obliterate potential transcription factor 
binding sites within the promoter individually or in combination. The effects of these 
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mutations on transcription levels may be determined by inserting the mutations into the 
cloning sites in the promoter reporter vectors. 

EXAMPLES? 
Cloning and Identification of Promoters 
5 Using the method described in Example 55 above with 5' ESTs, sequences 

upstream of several genes were obtained. Using the primer pairs GGG AAG ATG GAG 
ATA GTA TTG CCT G (SEQ ID NO:29) and CTG CCA TGT ACA TGA TAG AGA 
GAT TC (SEQ ID NO:30), the promoter having the internal designation PI 3H2 (SEQ ID 
N0:3 1 ) was obtained. 

10 Using the primer pairs GTA CCA GGGG ACT GTG ACC ATT GC (SEQ ID 

NO:32) and CTG TGA CCA TTG CTC CCA AGA GAG (SEQ ID NO:33), the promoter 
having the internal designation P 1 5B4 (SEQ ID NO:34) was obtained. 

Using the primer pairs CTG GGA TGG AAG GCA CGG TA (SEQ ID NO:35) 
and GAG ACC ACA CAG CTA GAC AA (SEQ ID NO:36), the promoter having the 

1 5 internal designation P29B6 (SEQ ID NO:37) was obtained. 

Figure 8 provides a schematic description of the promoters isolated and the way 
they are assembled with the corresponding 5' tags. The upstream sequences were 
screened for the presence of motifs resembling transcription factor binding sites or known 
transcription start sites using the computer program Matlnspector release 2.0, August 

20 1996. 

Figure 9 describes the transcription factor binding sites present in each of these 
promoters. The columns labeled matrice provides the name of the Matlnspector matrix 
used. The column labeled position provides the 5' postion of the promoter site. 
Numeration of the sequence starts fi-om the transcription site as determined by matching 
25 the genomic sequence with the 5' EST sequence. The column labeled "orientation" 
indicates the DNA strand on which the site is found, with the + strand being the coding 
strand as determined by matching the genomic sequence with the sequence of the 5' EST. 
The column labeled "score" provides the Matlnspector score found for this site. The 
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column labeled "length" provides the length of the site in nucleotides. The column labeled 
"sequence" provides the sequence of the site found. 

The promoters and other regulatory sequences located upstream of the extended 
cDNAs or 5' ESTs may be used to design expression vectors capable of directing the 
expression of an inserted gene in a desired spatial, temporal, developmental, or 
quantitative manner. A promoter capable of directing the desired spatial, temporal, 
developmental, and quantitative patterns may be selected using the results of the 
expression analysis described in Example 26 above. For example, if a promoter which 
confers a high level of expression in muscle is desired, the promoter sequence upstream of 
an extended cDNA or 5' EST derived from an mRNA which is expressed at a high level in 
muscle, as determined by the method of Example 26, may be used in the expression 
vector. 

Preferably, the desired promoter is placed near multiple restriction sites to 
facilitate the cloning of the desired insert downstream of the promoter, such that the 
promoter is able to drive expression of the inserted gene. The promoter may be inserted in 
conventional nucleic acid backbones designed for extrachromosomal repUcation, 
integration into the host chromosomes or transient expression. Suitable backbones for the 
present expression vectors include retroviral backbones, backbones from eukaryotic 
episomes such as SV40 or Bovine Papilloma Virus, backbones from bacterial episomes, 
or artificial chromosomes. 

Preferably, the expression vectors also include a polyA signal downstream of the 
multiple restriction sites for directing the polyadenylationof mRNA transcribed from the 
gene inserted into the expression vector. 

Following the identification of promoter sequences using the procedures of 
Examples 55-57, proteins which interact with the promoter may be identified as described 
in Example 58 below. 

EXAMPLE 58 

Identification of Proteins Which Interact with Promoter Sequences. Upstream 
Regulatory Sequences, or mRNA 
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Sequences within the promoter region which are Hkely to bind transcription factors 
may be identified by homology to known transcription factor binding sites or through 
conventional mutagenesis or deletion analyses of reporter plasmids containing the 
promoter sequence. For example, deletions may be made in a reporter plasmid containing 
5 the promoter sequence of interest operably linked to an assayable reporter gene. The 
reporter plasmids carrying various deletions within the promoter region are transfected 
into an appropriate host cell and the effects of the deletions on expression levels is 
assessed. Transcription factor binding sites within the regions in which deletions reduce 
expression levels may be further localized using site directed mutagenesis, linker scanning 

10 analysis, or other techniques familiar to those skilled in the art. Nucleic acids encoding 
proteins which interact v^th sequences in the promoter may be identified using one-hybrid 
systems such as those described in the manual accompanying the Matchmaker One- 
Hybrid System kit avalilabe from Clontech (Catalog No. K1603-1), the disclosure of 
which is incorporated herein by reference. Briefly, the Matchmaker One-hybrid system is 

1 5 used as follows. The target sequence for which it is desired to identify binding proteins is 
cloned upstream of a selectable reporter gene and integrated into the yeast genome. 
Preferably, multiple copies of the target sequences are inserted into the reporter plasmid in 
tandem. 

A library comprised of fusions between cDNAs to be evaluated for the ability to 
20 bind to the promoter and the activation domain of a yeast transcription factor, such as 
GAL4, is transformed into the yeast strain containing the integrated reporter sequence. 
The yeast are plated on selective media to select cells expressing the selectable marker 
linked to the promoter sequence. The colonies which grow on the selective media contain 
genes encoding proteins which bind the target sequence. The inserts in the genes 
25 encoding the fusion proteins are further characterized by sequencing. In addition, the 
inserts may be inserted into expression vectors or in vitro transcription vectors. Binding 
of the polypeptides encoded by the inserts to the promoter DNA may be confirmed by 
techniques familiar to those skilled in the art, such as gel shift analysis or DNAse 
protection analysis. 
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VIL Use of Extended cDNAs (or Genomic DNAs Obtainable Therefrom) in Gene 
Therapy 

The present invention also comprises the use of extended cDNAs (or genomic 
DNAs obtainable therefrom) in gene therapy strategies, including antisense and triple 
5 heUx strategies as described in Examples 57 and 58 below. In antisense approaches, 
nucleic acid sequences complementary to an mRNA are hybridized to the mRNA 
intracellularly, thereby blocking the expression of the protein encoded by the mRNA. The 
antisense sequences may prevent gene expression through a variety of mechanisms. For 
example, the antisense sequences may inhibit the ability of ribosomes to translate the 

10 mRNA. Alternatively, the antisense sequences may block transport of the mRNA from 
the nucleus to the cytoplasm, thereby limiting the amount of mRNA available for 
translation. Another mechanism through which antisense sequences may inhibit gene 
expression is by interfering with mRNA splicing. In yet another strategy, the antisense 
nucleic acid may be incorporated in a ribozyme capable of specifically cleaving the target 

15 mRNA. 

EXAMPLE 59 

Preparation and Use of Antisense Oligonucleotides 
The antisense nucleic acid molecules to be used in gene therapy may be either 
DNA or RNA sequences. They may comprise a sequence complementary to the sequence 
20 of the extended cDNA (or genomic DNA obtainable therefi-om). The antisense nucleic 
acids should have a length and melting temperature sufficient to permit formation of an 
intracellular duplex having sufficient stability to inhibit the expression of the mRNA in 
the duplex. Strategies for designing antisense nucleic acids suitable for use in gene 
therapy are disclosed in Green et al, Ann, Rev, Biochem. 55:569-597 (1986) and Izant 
25 and Weintraub, Cell 36:1 007-1015 (1984), which are hereby incorporated by reference. 

In some strategies, antisense molecules are obtained from a nucleotide sequence 
encoding a protein by reversing the orientation of the coding region with respect to a 
promoter so as to transcribe the opposite strand from that which is normally transcribed in 
the cell. The antisense molecules may be transcribed using in vitro transcription systems 
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such as those which employ T7 or SP6 polymerase to generate the transcript. Another 
approach involves transcription of the antisense nucleic acids in vivo by operably linking 
DNA containing the antisense sequence to a promoter in an expression vector. 

Altematively, oligonucleotides which are complementary to the strand normally 
5 transcribed in the cell may be synthesized in vitro. Thus, the antisense nucleic acids are 
complementary to the corresponding mRNA and are capable of hybridizing to the mRNA 
to create a duplex. In some embodiments, the antisense sequences may contain modified 
sugar phosphate backbones to increase stability and make them less sensitive to RNase 
activity. Examples of modifications suitable for use in antisense strategies are described 
1 0 by Rossi et al., Pharmacol. Then 50(2):245-254, (1991). 

Various types of antisense oligonucleotides complementary to the sequence of the 
extended cDNA (or genomic DNA obtainable therefrom) may be used. In one preferred 
embodiment, stable and semi-stable antisense oligonucleotides described in International 
Application No. PCT WO94/23026, hereby incorporated by reference, are used. In these 
15 moleucles, the 3' end or both the 3' and 5' ends are engaged in intramolecular hydrogen 
bonding between complementary base pairs. These molecules are better able to withstand 
exonuclease attacks and exhibit increased stability compared to conventional antisense 
oUgonucleotides. 

In another preferred embodiment, the antisense oligodeoxynucleotides against 
20 herpes simplex virus types 1 and 2 described in International Application No. WO 
95/04141, hereby incorporated by reference, are used. 

In yet another preferred embodiment, the covalently cross-linked antisense 
oligonucleotides described in International Application No. WO 96/31523, hereby 
incorporated by reference, are used. These double- or single-stranded oligonucleotides 
25 comprise one or more, respectively, inter- or intra-oligonucleotidecovalent cross-linkages, 
wherein the linkage consists of an amide bond between a primary amine group of one 
strand and a carboxyl group of the other strand or of the same strand, respectively, the 
primary amine group being directly substituted in the T position of the strand nucleotide 
monosaccharide ring, and the carboxyl group being carried by an aliphatic spacer group 
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substituted on a nucleotide or nucleotide analog of the other strand or the same strand, 
respectively. 

The antisense oligodeoxynucleotides and oligonucleotides disclosed in 
International Application No. WO 92/1 8522, incorporated by reference, may also be used. 
5 These molecules are stable to degradation and contain at least one transcription control 
recognition sequence which binds to control proteins and are effective as decoys therefor. 
These molecules may contain "hairpin" structures, "dumbbell" structures, "modified 
dumbbell" structures, "cross-linked" decoy structures and "loop" structures. 

In another preferred embodiment, the cyclic double-stranded oligonucleotides 
10 described in European Patent Application No. 0 572 287 A2, hereby incorporated by 
reference are used. These ligated oligonucleotide "dumbbells" contain the binding site for 
a transcription factor and inhibit expression of the gene xmder control of the transcription 
factor by sequestering the factor. 

Use of the closed antisense oligonucleotides disclosed in Intemational Application 
15 No. WO 92/1 9732, hereby incorporated by reference, is also contemplated. Because these 
molecules have no free ends, they are more resistant to degradation by exonucleases than 
are conventional oligonucleotides. These oligonucleotides may be multifunctional, 
interacting with several regions which are not adjacent to the target mRNA. 

The appropriate level of antisense nucleic acids required to inhibit gene expression 
20 may be determined using in vitro expression analysis. The antisense molecule may be 
introduced into the cells by diffusion, injection, infection or transfection using procedures 
known in the art. For example, the antisense nucleic acids can be introduced into the body 
as a bare or naked oligonucleotide, oligonucleotide encapsulated in lipid, oligonucleotide 
sequence encapsidated by viral protein, or as an oligonucleotide operably linked to a 
25 promoter contained in an expression vector. The expression vector may be any of a 
variety of expression vectors known in the art, including retroviral or viral vectors, vectors 
capable of extrachromosomal replication, or integrating vectors. The vectors may be 
DNAorRNA. 
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The antisense molecules are introduced onto cell samples at a number of different 
concentrations preferably between IxlO'^^M to lxlO"^M. Once the minimum 
concentration that can adequately control gene expression is identified, the optimized dose 
is translated into a dosage suitable for use in vivo. For example, an inhibiting 
5 concentration in culture of 1x10"^ translates into a dose of approximately 0.6 mg/kg 
bodyweight. Levels of oligonucleotide approaching 1 00 mg/kg bodyw^eight or higher may 
be possible after testing the toxicity of the oligonucleotide in laboratory animals. It is 
additionally contemplated that cells from the vertebrate are removed, treated with the 
antisense oligonucleotide, and reintroduced into the vertebrate. 

10 It is further contemplated that the antisense oligonucleotide sequence is 

incorporated into a ribozyme sequence to enable the antisense to specifically bind and 
cleave its target mRNA. For technical applications of ribozyme and antisense 
oligonucleotides see Rossi et al, supra. 

In a preferred application of this invention, the polypeptide encoded by the gene is 

15 first identified, so that the effectiveness of antisense inhibition on translation can be 
monitored using techniques that include but are not limited to antibody-mediated tests 
such as RIAs and ELISA, functional assays, or radiolabeling. 

The extended cDNAs of the present invention (or genomic DNAs obtainable 
therefrom) may also be used in gene therapy approaches based on intracellular triple helix 

20 formation. Triple helix oligonucleotides are used to inhibit transcription from a genome. 
They are particularly useful for studying alterations in cell activity as it is associated with 
a particular gene. The extended cDNAs (or genomic DNAs obtainable therefirom) of the 
present invention or, more preferably, a portion of those sequences, can be used to inhibit 
gene expression in individuals having diseases associated with expression of a particular 

25 gene. Similarly, a portion of the extended cDNA (or genomic DNA obtainable therefrom) 
can be used to study the effect of inhibiting transcription of a particular gene within a cell. 
Traditionally, homopurine sequences were considered the most useful for triple helix 
strategies. However, homopyrimidine sequences can also inhibit gene expression. Such 
homopyrimidine oligonucleotides bind to the major groove at 
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homopurineihomopyrimidinesequences. Thus, both types of sequences from the extended 
cDNA or from the gene corresponding to the extended cDNA are contemplated within the 
scope of this invention. 

EXAMPLE 60 

5 Preparation and use of Triple Helix Probes 

The sequences of the extended cDNAs (or genomic DNAs obtainable therefrom) 
are scanned to identify 10-mer to 20-mer homopyrimidine or homopurine stretches which 
could be used in triple-helix based strategies for inhibiting gene expression. Following 
identification of candidate homopyrimidine or homopurine stretches, their efficiency in 

1 0 inhibiting gene expression is assessed by introducing varying amounts of oligonucleotides 
containing the candidate sequences into tissue culture cells which normally express the 
target gene. The oligonucleotides may be prepared on an oligonucleotide synthesizer or 
they may be purchased commercially from a company specializing in custom 
oligonucleotide synthesis, such as GENSET, Paris, France. 

1 5 The oligonucleotides may be introduced into the cells using a variety of methods 

knovra to those skilled in the art, including but not limited to calcium phosphate 
precipitation, DEAE-Dextran, electroporation, liposome-mediated transfection or native 
uptake. 

Treated cells are monitored for altered cell fimction or reduced gene expression 
20 using techniques such as Northern blotting, RNase protection assays, or PGR based 
strategies to monitor the transcription levels of the target gene in cells which have been 
treated with the oligonucleotide . The cell functions to be monitored are predicted based 
upon the homologies of the target gene corresponding to the extended cDNA from which 
the oligonucleotide was derived with known gene sequences that have been associated 
25 with a particular function. The cell fimctions can also be predicted based on the presence 
of abnormal physiologies within cells derived from individuals with a particular inherited 
disease, particularly when the extended cDNA is associated with the disease using 
techniques described in Example 53. 
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The oligonucleotides which are effective in inhibiting gene expression in tissue 
culture cells may then be introduced in vivo using the techniques described above and in 
Example 59 at a dosage calculated based on the in vitro results, as described in Example 
59. 

5 In some embodiments, the natural (beta) anomers of the oligonucleotide units can 

be replaced with alpha anomers to render the oligonucleotide more resistant to nucleases. 
Further, an intercalating agent such as ethidium bromide, or the like, can be attached to the 
3' end of the alpha oligonucleotide to stabilize the triple helix. For information on the 
generation of oligonucleotides suitable for triple helix formation see Griffin et al. (Science 
1 0 245:967-971 (1 989), which is hereby incorporated by this reference). 

EXAMPLE 61 

Use of Extended cDNAs to Express an Encoded Protein in a Host Organism 
The extended cDNAs of the present invention may also be used to express an 
encoded protein in a host organism to produce a beneficial effect. In such procedures, the 

1 5 encoded protein may be transiently expressed in the host organism or stably expressed in 
the host organism. The encoded protein may have any of the activities described above. 
The encoded protein may be a protein which the host organism lacks or, alternatively, the 
encoded protein may augment the existing levels of the protein in the host organism. 

A full length extended cDNA encoding the signal peptide and the mature protein, 

20 or an extended cDNA encoding only the mature protein is introduced into the host 
organism. The extended cDNA may be introduced into the host organism using a variety 
of techniques known to those of skill in the art. For example, the extended cDNA may be 
injected into the host organism as naked DNA such that the encoded protein is expressed 
in the host organism, thereby producing a beneficial effect. 

25 Alternatively, the extended cDNA may be cloned into an expression vector 

downstream of a promoter which is active in the host organism. The expression vector 
may be any of the expression vectors designed for use in gene therapy, including viral or 
retroviral vectors. 
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The expression vector may be directly introduced into the host organism such that 
the encoded protein is expressed in the host organism to produce a beneficial effect. In 
another approach, the expression vector may be introduced into cells in vitro. Cells 
containing the expression vector are thereafter selected and introduced into the host 
5 organism, where they express the encoded protein to produce a beneficial effect. 

EXAMPLE 62 
Use Of Signal Peptides Encoded By 5' Ests Or Sequences 
Obtained Therefrom To Import Proteins Into Cells 
The short core hydrophobic region (h) of signal peptides encoded by the 5 'ESTS 
1 0 or extended cDNAs derived fi:om the 5'ESTs of the present invention may also be used as 
a carrier to import a peptide or a protein of interest, so-called cargo, into tissue culture 
cells (Lm et al, J. Biol Chem,, 270: 14225-14258 (1995); Du et al, ^ Peptide Res,, 51: 
235-243 (1998); Rojas^^^/, Nature Biotech, 16: 370-375 (1998)). 

When cell permeable peptides of limited size (approximately up to 25 amino 
15 acids) are to be translocated across cell membrane, chemical synthesis may be used in 
order to add the h region to either the C-terminus or the N-terminus to the cargo peptide of 
interest. Alternatively, when longer peptides or proteins are to be imported into cells, 
nucleic acids can be genetically engineered, using techniques familiar to those skilled in 
the art, in order to link the extended cDNA sequence encoding the h region to the 5' or the 
20 3' end of a DNA sequence coding for a cargo polypeptide. Such genetically engineered 
nucleic acids are then translated either in vitro or in vivo after transfectioninto appropriate 
cells, using conventional techniques to produce the resulting cell permeable polypeptide. 
Suitable hosts cells are then simply incubated with the cell permeable polypeptide which 
is then translocated across the membrane. 
25 This method may be applied to study diverse intracellular functions and cellular 

processes. For instance, it has been used to probe functionally relevant domains of 
intracellular proteins and to examine protein-protein interactions involved in signal 
transduction pathways (Lin et al, supra; Lin et al, 1 Biol Chem., Ill: 5305-5308 
(1 996); Rojas et al, J, Biol Chem,, 271: 27456-27461 (1996); Liu et aU Proc, Natl Acad 
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ScL USA, 93: 1 1819-1 1824(1996); Rojas etaL, Biock Biophys, Res, Commun,, 234: 675- 
680(1997)). 

Such techniques may be used in cellular therapy to import proteins producing 
therapeutic effects. For instance, cells isolated from a patient may be treated with 
5 imported therapeutic proteins and then re-introduced into the host organism. 

Alternatively, the h region of signal peptides of the present invention could be 
used in combination with a nuclear localization signal to deliver nucleic acids into cell 
nucleus. Such oligonucleotides may be antisense oligonucleotides or oligonucleotides 
designed to form triple helixes, as described in examples 59 and 60 respectively, in order 
10 to inhibit processing and maturation of a target cellular RNA. 

EXAMPLE 63 
Reassembling & Resequencing of Clones 
Full length cDNA clones obtained by the procediire described in Example 27 
were double-sequenced. These sequences were assembled and the resulting consensus 
15 sequences were then reanalyzed. Open reading frames were reassigned following 
essentially the same process as the one described in Example 27. 

After this reanalysis process a few abnormalities were revealed. The sequences 
presented in SEQ ID NOs: 47, 73, 79, 89, 91, 96, 126, 128, 134, and 139 are apparently 
unlikely to be genuine full length cDNAs. These clones are missing a stop codon and 
20 are thus more probably 3' truncated cDNA sequences. Similarly, the sequences 
presented in SEQ ID NOs: 45, 50, 54, 57, 73, 74, 89, 92, 95, 98, 126, 129, 130, 131 and 
139 may also not be genuine full length cDNAs based on homology studies with 
existing protein sequences. Although both of these sequences encode a potential start 
methionine each could represent a 5' truncated cDNA. 
25 In addition, SEQ ID NO: 115 was found to be an altematively spliced transcript 

and the identities of some of the bases in SEQ ID NO: 131 were corrected. 

Finally, after the reassignment of open reading frames for the clones, new open 
reading frames were chosen in some instances. For example, in the case of SEQ ID 
NOs: 41, 47, 50, 52, 54-56, 58, 59, 61, 74, 75, 79, 84, 89, 91, 92, 96, 98, 103, 105, 106, 
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126, 129, 131, and 133 the new open reading frames were no longer predicted to contain 
a signal peptide. 

As discussed above, Table IV provides the sequence identification numbers of 
the extended cDNAs of the present invention, the locations of the full coding sequences 
5 in SEQ ID NOs: 40-140 and 242-377 (i.e. the nucleotides encoding both the signal 
peptide and the mature protein, listed under the heading PCS location in Table IV), the 
locations of the nucleotides in SEQ ID NOs: 40-140 and 242-377 which encode the 
signal peptides (listed imder the heading SigPep Location in Table IV), the locations of 
the nucleotides in SEQ ID NOs: 40-140 and 242-377 which encode the mature proteins 

10 generated by cleavage of the signal peptides (listed imder the heading Mature 
Polypeptide Location in Table IV), the locations in SEQ ID NOs: 40-140 and 242-377 
of stop codons (listed under the heading Stop Codon Location in Table IV) the locations 
in SEQ ID NOs: 40-140 and 242-377 of polyA signals (listed under the heading g 
PolyA Signal Location in Table IV) and the locations of polyA sites (listed under the 

1 5 heading PolyA Site Location in Table IV). 

As discussed above, Table V lists the sequence identification numbers of the 
polypeptides of SEQ ID NOs: 141-241 and 378-513, the locations of the amino acid 
residues of SEQ ID NOs: 141-241 and 378-513 in the full length polypeptide (second 
column), the locations of the amino acid residues of SEQ ID NOs: 141-241 and 378-513 

20 in the signal peptides (third column), and the locations of the amino acid residues of 
SEQ ID NOs: 141-241 and 379-513 in the mature polypeptide created by cleaving the 
signal peptide from the fall length polypeptide (fourth column). In Table V, and in the 
appended sequence listing, the first amino acid of the mature protein resulting from 
cleavage of the signal peptide is designated as amino acid number 1 and the first amino 

25 acid of the signal peptide is designated with the appropriate negative number, in 
accordance with the regulations governing sequence listings. 

EXAMPLE 64 
Functional Analysis of Predicted Protein Sequences 
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Following double-sequencing, new contigs were assembled for each of the 
extended cDNAs of the present invention and each was compared to known sequences 
available at the time of filing. These sequences originate from the following databases: 
Genbank (release 108 and daily releases up to October, 15, 1998), Genseq (release 32) 
5 PIR (release 53) and SwissProt (release 35). The predicted proteins of the present 
invention matching known proteins were further classified into 3 categories depending 
on the level of homology. 

The first category contains proteins of the present invention exhibiting more 
than 70% identical amino acid residues on the whole length of the matched protein. 
10 They are clearly close homologues which most probably have the same function or a 
very similar function as the matched protein. 

The second category contains proteins of the present invention exhibiting more 
remote homologies (40 to 70% over the whole protein) indicating that the protein of the 
present invention may have functions similar to those of the homologous protein. 
15 The third category contains proteins exhibiting homology (90 to 100%) to a 

domain of a known protein indicating that the matched protein and the protein of the 
invention may share similar features. 

It should be noted that the numbering of amino acids in the protein sequences 
discussed in Figures 10 to 15, and Table VIII, the first methionine encountered is 
20 designated as amino acid number 1, In the appended sequence listing, the first amino 
acid of the mature protein resulting from cleavage of the signal peptide is designated as 
amino acid -number 1, and the first amino acid of the signal peptide is designated with 
the appropriate negative number, in accordance with the regulations goveming sequence 
listings. 

25 In addition all of the corrected amino acid sequences (SEQ ED NOs: 141-241 

and 378-513) were scanned for the presence of known protein signatures and motifs. 
This search was performed against the Prosite 15.0 database, using the Proscan software 
from the GCG package- Fimctional signatures and their locations are indicated in 
Table VIII. 
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A) Proteins which are closely related to known proteins 

Protein of SEP ID NO: 217 rintemal designation 48-46-4-Al 1-CL 1 4 v>) 

The protein of SEQ ID NO: 217 encoded by the extended cDNA SEQ ID NO: 
5 116 isolated from lymphocyte shows complete identity to a human protein TFARl 9 that 

may play a role in apoptosis (Genbank accession number AF014955, SEQ ID NO: 516) 

as shown by the alignment in figure 10. 

Taken together, these data suggest that the protein of SEQ ID NO: 217 may be 

involved in the control of development and homeostasis. Thus, this protein may be 
10 useful in diagnosis and/or treating several types of disorders including, but not limited 

to, cancer, autoimmune disorders, viral infections such as AIDS, neurodegenerative 

disorders, osteoporosis. 

Proteins of SEO ID NOs: 174. 175 and 232 internal designation 45-54-1-09^ 
15 The proteins of SEQ ID NOs: 174, 175 and 232 encoded by the extended 

cDNAs SEQ ID NOs:. 73, 74 and 131 respectively and isolated from lymphocytes 
shows complete extensive homologies to a human secreted protein (Genseq accession 
number W36955, SEQ ID NO: 517). As shown by the alignments of figure 11, the 
amino acid residues are identical to those of the 110 amino acid long matched protein 
20 except for positions 51 and 108-1 10 of the matched protein for the protein of SEQ ID 
NOs: 174, for positions 48, 94 and 108-1 10 of the matched protein of SEQ ID NOs:175 
and for positions 94, and 108-110 of the matched protein for the protein of SEQ ID 
NOs: 232. Proteins of SEQ ID NOs: 174 and 232 may represent alternative forms 
issued from alternative use of polyadenylation signals. 
25 Taken together, these data suggest that the proteins of SEQ ID NOs: 174, 175 

and 232 may play a role in cell proliferation and/or differentiation, in immune responses 
and/or in haematopoeisis. Thus, this protein or part therein, may be useful in 
diagnosing and treating several disorders including, but not limited to, cancer, 
immunological, haematological and/or inflanmiatory disorders. It may also be useful in 
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modulating the immune and inflammatory responses to infectious agents and/or to 
suppress graft rejection . 

Proteins of SEP ID NO: 231 Tintemal designation 47-4-4-C6-CL2 2^ 
5 The protein of SEQ ID NO: 231 encoded by the extended cDNA SEQ ID NO: 

130 shows extensive homology with the human E25 protein (Genbank accession 
number AF038953, SEQ ID NO: 515). As shown by the alignments in figure 12, the 
amino acid residues are identical except for position 159 in the 263 amino acid long 
matched sequence. The matched protein might be involved in the development and 

10 differentiation of haematopoietic stem/progenitor cells. In addition, it is the human 
homologue of a murine protein thought to be involved in chondro-osteogenic 
differentiation and belonging to a novel multigene family of integral membrane proteins 
(Deleersnijder et al J. Biol Chem., 271 : 19475-19482 (1996)). 

The protein of invention contains two short segments from positions 1 to 21 and 

15 from 100 to 120 as predicted by the software TopPred II (Claros and von Heijne, 
CABIOS applic. Notes, 10 : 685-686 (1994)). The first transmembrane domains 
matches exactly those predicted for the murine E25 protein. 

Taken together, these data suggest that the protein of SEQ ID NO: 231 may be 
involved in cellular proliferation and differentiation. Thus, this protein may be usefiil in 

20 diagnosing and/or treating several types of disorders including, but not limited to, 
cancer and embryogenesis disorders. 

Protein of SEO ID NO: 196 (internal designation 76-13-3-A9-CL1 2^ 

The protein of SEQ ID NO: 196 encoded by the extended cDNA SEQ ID NO: 
25 95 shows extensive homology with the human seventransmembrane protein (Genbank 
accession number Y11395, SEQ ID NO: 518) and its murine homologue (Genbank 
accession number Y11550). As shown by the alignments in figure 13, the amino acid 
residues are identical except for position 174 in the 399 amino acid long human 
matched sequence. The matched protein potentially associated to stomatin may act as a 
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G-protein coupled receptor and is likely to be important for the signal transduction in 
neurons and haematopoietic cells (Mayer et al Biochem. Biophys. Acta, 1395 : 301-308 
(1998)). 

Taken together, these data suggest that the protein of SEQ ID NOs: 196 may be 
5 involved in signal transduction. Thus, this protein may be useful in diagnosing and/or 
treating several types of disorders including, but not limited to, cancer, 
neurodegenerative diseases cardiovascular disorders, hypertension, renal injury and 
repair and septic shock. 

10 Protein of SEP ID NO: 158 rintemal designation 33-35-4-Gl-CLl 2^ 

The protein of SEQ ID NOs: 158 encoded by the extended cDNA SEQ ID NO: 
57 shows homology with the murine subunit 7a of the C0P9 complex (Genbank 
accession number AF071316, SEQ ID NO: 519). As shown by the alignments in figure 
14, the amino acid residues are identical except for positions 90, 172 and 247 in the 275 

15 amino acid long matched sequence. This complex is highly conserved between 
mammals and higher plants where it has been shown to act as a repressor of 
photomorphogenesis All the components of the mammalian C0P9 complex contain 
structural features also present in components of the proteasome regulatory complex and 
the translation initiation complex eIF3 complex, suggesting that the mammalian C0P9 

20 complex is an important cellular regulator modulating multiple signaling pathways (Wei 
etal Curr. Biol, 8 : 919-922 (1998)). 

Taken together, these data suggest that the protein of SEQ ID NO: 158 may be 
involved in cellular signaling, probably as a subunit of the human C0P9 complex. 
Thus, this protein may be useful in diagnosing and/or treating several types of disorders 

25 including, but not limited to, cancer, neurodegenerative diseases, cardiovascular 
disorders, hypertension, renal injury and repair and septic shock. 

Protein of SEO ID NO: 226 rintemal designation 19-10-1-C2-CL1 3) 
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The protein of SEQ ID NO: 226 encoded by the extended cDNA SEQ ID NO: 
125 shows homology with the bovine subunit B14.5B of the NADH-ubiquinone 
oxidureductase complex (Arizmendi et al FEES Lett, 313 : 80-84 (1992) and Swissprot 
accession -number Q02827, SEQ ID NO: 514). As shown by the alignments in figure 
5 15, the amino acid residues are identical except for positions 3-4, 6-12, 32-34, 47, 53- 
55, 67 and 69-74 in the 120 amino acid long matched sequence. This complex is the 
first of four complexes located in the inner mitochondrial membrane and composing the 
mitochondrial electron transport chain. Complex I is involved in the dehydrogenation 
of NADH and the transportation of electrons to coenzyme Q. It is composed of 7 

10 subunits encoded by the mitochondrial genome and 34 subunits encoded by the nuclear 
genome. It is also thought to play a role in the regulation of apoptosis and necrosis. 
Mitochondriocytopathies due to complex I deficiency are frequently encountered and 
affect tissues with a high energy demand such as brain (mental retardation, convulsions, 
movement disorders), heart (cardiomyopathy, conduction disorders), kidney (Fanconi 

15 syndrome), skeletal muscle (exercise intolerance, muscle weakness, hypotonia) and/or 
eye (opthmaloplegia, ptosis, cataract and retinopathy). For a review on complex I see 
Smeitink et al, Hum. Mol Gent,, 7 : 1573-1579 (1998). 

Taken together, these data suggest that the protein of SEQ ID NO: 226 may be 
part of the mitochondrial energy-generating system, probably as a subunit of the 

20 NADH-ubiquinone oxidoreductase complex. Thus, this protein or part therein, may be 
useful in diagnosing and/or treating several disorders including, but not limited to, brain 
disorders (mental retardation, convulsions, movement disorders), 'heart disorders 
(cardiomyopathy, conduction disorders), kidney disorders (Fanconi syndrome), skeletal 
muscle disorders (exercise intolerance, muscle weakness, hypotonia) and/or eye 

25 disorders opthmalmoplegia, ptosis, cataract and retinopathy). 

B) Proteins which are remotely related to proteins with known functions 

Proteins of SEO ID NOs: 149, 150 and 21 1 rintemal designation 27-1-2-B3-CL0 x p^ 
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The proteins of SEQ ID NOs: 1.49,150 and 211 encoded by the extended 
cDNAs SEQ ID NOs: 48, 49 and 1 10 respectively and found in, skeletal muscle shows 
homologies with T1/ST2 ligand polypeptide of either human (Genbank accession 
number U41804 and Genseq accession number W09639) or rodent species (Genbank 
5 accession number U41805 and Genseq accession number WO9640). These 
polypeptides are thought to be cytokines that bind to the ST2 receptor, a member of the 
immunoglobulin family homologous to the interleukin-1 receptor and present on some 
lymphoma cells. They are predicted to be cell-surface proteins containing a short 
transmembrane domain. (Gayle et al 1 Biol Chem,, 271 : 5784-5789 (1996)). Proteins 

10 of SEQ ID NOs: 149, 150 and 211 may represent alternative forms issued from 
alternative use of polyadenylation signals. 

The protein of invention contains two short transmembrane segments from 
positions 5 to 25 and from 195 to 215 as predicted by the software TopPred II (Claros 
and von Heijne, CABIOS applic. Notes, 10 :685-686 (1994)). The second 

1 5 transmembrane domain matches exactly those of the matched cell-surface protein. 

Taken together, these data suggest that the protein of SEQ ID NOs: 149, 1 50 and 
211 may act as a cytokine, thus may play a role in the regulation of cell growth and 
differentiation and/or in the regulation of the immune response. Thus, this protein or 
part therein, may be useful in diagnosing and treating several disorders including, but 

20 not limited to, cancer, immunological, haematological and/or inflammatory disorders. 
It may also be useful in modulating the immune and inflammatory responses to 
infectious agents such as HIV and/or to suppress graft rejection. 

Protein of SEO ID NO: 177 (internal designation 51-1 1-3-D5-CL1 3) 
25 The protein SEQ ID NO: 177 found in testis encoded by the extended cDNA 

SEQ ID NO: 76 shows homologies to serine protease inhibitor proteins belonging to the 
pancreatic trypsin inhibitor family (Kunitz) such as the extracellular proteinase inhibitor 
named chelonianin (Swissprot accession number P00993). The characteristic PROSITE 
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signature of this family is conserved in the protein of the invention (positions 69 to 87) 
except for a drastic change of the last cysteine residue into an arginine residue. 

Taken together, these data suggest that the protein of SEQ ID NO: 177 may be a 
protease inhibitor, probably of the Kunitz family. Thus, this protein or part therein, may 
5 be useful in diagnosing and treating several disorders including but not limited to, 
cancer and neurodegenerative disorders such as Alzheimer*s disease. 

Protein of SEP ID NO: 146 (internal designation 26-27-3-D7-CL0 D 

The protein SEQ ID NO: 146 encoded by the extended cDNA SEQ ID NO: 45 
10 shows homology to human apolipoprotein L (Genbank accession number AFO 19225). 
The matched protein is a secreted high density lipoprotein associated with apoA-I- 
containing lipoproteins which play a key role in reverse cholesterol transport. 

Taken together, these data suggest that the protein of SEQ ID NO. 146 may play 
a role in lipid metabolism. Thus, this protein may be useful in diagnosing and/or 
15 treating several types of disorders including, but not limited to, hyperlipidemia, 
hypercholesterolemia, atherosclerosis, cardiovascular disorders such as, coronary heart 
disease, and neurodegenerative disorders such as Alzheimer's disease or dementia. 

Protein of SEO ID NO: 163 internal designation 33-49-1-H4-CL1 n 
20 The protein SEQ ED NO: 163 encoded by the extended cDNA SEQ ID NO: 62 

shows homology to the yeast autophagocytosis protein AUTl (SwissProt accession 
number P40344). The matched protein is required for starvation-induced non-specific 
bulk transport of cytoplasmic proteins to the vacuole. 

Taken together, these data suggest that the protein of SEQ ID NO: 1 63 may play 
25 a role in protein transport. Thus, this protein may be useful in diagnosing and/or 
treating several types of disorders including, but not limited to, autoimmune disorders 
and immune disorders due to dysfunction of antigen presentation. 

C) Proteins homologous to a domain of a protein with known function 
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Protein of SEP ID NO: 214 rintemal designation 33-28-4-Dl-CLO 1 p) 

The protein of SEQ ID NO: 214 encoded by the extended cDNA SEQ ID NO: 
113 and expressed in adult brain shows extensive homology to part of the murine 
SHYC protein (Genbank accession number AF072697) which is expressed in the 
5 developing and embryonic nervous system as well as along the olfactory pathway in 
adult brains (Koster et aL, Neuroscience Letters., 252 : 69-71 (1998)). 

Taken together, these data suggest that the protein of SEQ ID NO: 214 may play 
a role in nervous system development and function. Thus, this protein may be useful in 
diagnosing and/or treating cancer and/or brain disorders, including neurodegenerative 
1 0 disorders such as Alzheimer's and Parkinson's diseases. 

Protein of SEO ID NO: 225 fintemal designation 78-8-3-E6-CL0 1) 

The protein of SEQ ID NO: 225 encoded by the extended cDNA SEQ ID NO: 

124 and expressed in adult prostate belong to the phosphatidylethanolainin-binding 
15 protein from which it exhibits the characteristic PROSITE signature from positions 90 

to 112 (see table VIII). Proteins from this widespread family, from nematodes to fly, 

yeast, rodent and primate species, bind hydrophobic ligands such as phospholipids and 

nucleotides. They are mostly expressed in brain and in testis and are thought to play a 

role in cell growth and/or maturation, in regulation of the sperm maturation, motility 
20 and 'in membrane remodeling. They may act either through signal transduction or 

through oxidoreduction reactions (for a review see Schoentgen and JoUes, FEES 

Letters, 369:22-26(1995)). 

Taken together, these data suggest that the protein of SEQ ID NO: 225 may play 

a role in cell. Thus, these growth, maturation and in membrane remodeling and/or may 
25 be related to male fertility. Thus, this protein may be useful in diagnosing and/or 

treating cancer, neurodegenerative diseases, and/of, disorders related to male fertility 

and sterility. 

Protein of SEO ID NO: 153 fintemal designation 33-10-4-H2-CL2 2) 
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The protein of SEQ ID NO: 153 encoded by the extended cDNA SEQ ID NO. 
52 and expressed in brain exhibits homology to different integral membrane proteins. 
These membrane proteins include the nematode protein SRE-2 (Swissprot accession 
number Q09273) that belongs to the multigene SRE family of C elegans receptor-like 
5 proteins and a family of tricarboxylate carriers conserved between flies and mammals. 
One member of this matched family is the rat tricarboxylate carrier (Genbank accession 
number S7001 1), an anion transporter localized in the inner membrane of mitochondria 
and involved in the biosynthesis of fatty acids and cholesterol. The protein of the 
invention contains a short transmembrane segments from positions 5 to 25 as predicted 
10 by the software TopPred II (Claros and von Heijne, CABIOS applic. Notes, 10 :685-686 
(1994)). 

Taken together, these data suggest that the protein of SEQ ID NO: 1 53 may play 
a role in signal transduction and/or in molecule transport. Thus, this protein may be 
useful in diagnosing and/or treating several types of disorders including, but not limited 
15 to, cancer, neurodegenerative diseases, immune disorders, cardiovascular disorders, 
hypertension, renal injury and repair and septic shock 

Protein of SEO ID NO: 213 (internal designation 33-106-2-FlO-CLl 3) 

The protein of SEQ ID NO: 213 encoded by the extended cDNA SEQ ID NO: 

20 1 12 and expressed in brain exhibits homology v^th part of the tRNA pseudouridine 55 
synthase found in Escherichia Coli (Swissprot accession number P09171). This 
bacterial protein belongs to the NAP57/CBF5/TRUB family of nucleolar proteins found 
in bacteria, yeasts and mammals involved in rRNA or tRNA biosynthesis, ribosomal 
subunit assembly and/or centromere/mircotubule binding. 

25 Taken together, these data suggest that the protein of SEQ ID NO: 213 may play 

a role in rRNA or tRNA biogensis and function. Thus, this protein may be useful in 
diagnosing and/or treating several types of disorders including, but not limited to, 
cancer, hearing loss or disorders linked to chromosomal instability such as dyskeratosis. 
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Protein of SEP ED NO: 240 rintemal designation 78-21-3-G7-CL2 I) 

The protein of SEQ ID NO: 240 encoded by the extended cDNA SEQ ID NO: 

139 and expressed in brain exhibits homology with a family of eukaryotic cell surface 

antigens containing 4 transmembrane domains. The PROSITE signature for this family 
5 is conserved in the protein of the invention except for a substitution of an alanine 

residue in place of any of the following hydrophic residues : leucine, valine, isoleucine 

or methionine (positions 21 to 36). 

The protein of the invention contains three short transmembrane segments from 

positions 6 to 26, 32 to 52 and from 56 to 76 as predicted by the software TopPred II 
10 (Claros and von Heijne, CABIOS applic. Notes, 10 : 685-686 (1994)). These 

transmembrane domains match the last three transmembrane domains of the matched 

protein family. 

Taken together, these data suggest that the protein of SEQ ID NO: 240 may play 
a role in immunological and/or inflammatory responses, probably as a cell surface 
15 antigen. Thus, this protein or part therein, may be useful in diagnosing and treating 
several disorders including, but not limited to, cancer, immunological, haematological 
and/or inflammatory disorders. It may also be useful in modulating the immune and 
inflammatory responses to infectious agents and/or to suppress graft rejection. 

20 Protein of SEQ ID NO: 239 (internal designation 76-30-3-B7-CL1 n 

The protein of SEQ ID NO: 239 encoded by the extended cDNA SEQ ID NO: 
138 exhibits homology with a conserved region in a family of NA+/H+ exchanger 
conserved in yeast, nematode and mammals. These cation/proton exchangers are 
integral membrane proteins with 5 transmembrane segments involved in intracellular 

25 pH regulation, maintenance of cell volume, reabsorption of sodium across specialized 
epithelia, vectorial transport and are also thought to play a role in signal transduction 
and especially in the induction of cell proliferation and in the induction of apoptosis. 

The protein of invention contains four short transmembrane segments from 
positions 21 to 41, 48 to 68 and from 13 1 to 151 as predicted by the software TopPred II 
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(Claros and von Heijne, CABIOS applic. Notes, 10 : 685-686 (1994)), The third and 
fourth transmembrane domains match the fourth and fifth transmembrane segments of 
the matched family of proteins. 

Taken together, these data suggest that the protein of SEQ ID NO: 239 may play 
5 a role in membrane permeability and/or in signal transduction. Thus, this protein may 
be useful in diagnosing and/or treating several types of disorders including, but not 
limited to, cancer, neurodegenerative diseases, cardiovascular disorders, hypertension, 
renal injury and repair, septic shock as well as disorders of membrane permeability such 
as diarrhea. 

10 

Protein of SEP ID NO: 200 internal designation 77-16-4-G3-CL1 3^ 

The protein of SEQ ID NO: 200 encoded by the extended cDNA SEQ ED NO: 

99 and expressed in brain exhibits extensive homology to the N-terminus of cell 

division cycle protein 23 (Genbank accession niimber AF053977) and also to a lesser 
15 extent to its homologue in Saccharomyces cerevisiae. The matched protein is required 

for chromosome segregation and is part of the anaphae-promoting complex necessary 

for cell cycle progression to mitosis. 

Taken together, these data suggest that the protein of SEQ ID NO: 200 may play 

a role in cellular mitosis. Thus, this protein may be useful in diagnosing and/or treating 
20 several types of disorders including, but not limited to, cancer and leukemia. 

Protein of SEO ID NO: 230 fintemal designation 33-61-2-F6-CL0 2) 

The protein of SEQ ID NO: 230 encoded by the extended cDNA SEQ ID NO: 
129 exhibits extensive homology to the C-terminus of the eta subunit of T-complex 
25 polypeptide 1 conserved from yeasts to mammals, and even complete identity with the 
last 54 amino acid residues of the human protein (Genbank accession number 
AF026292). The matched protein is a chaperonin which assists the folding of actins and 
tubulins in eukaryotic cells upon ATP hydrolysis. 
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Taken together, these data suggest that the protein of SEQ ID NO: 230 may play 
a role in the folding, transport, assembly and degradation of proteins. Thus, this protein 
may be useful in diagnosing and/or treating several types of disorders including, but not 
limited to, cancer, cardiovascular disorders, immune disorders, neurodegenerative 
disorders, osteoporosis and arthritis. 

Protein of SEP ED NO: 167 (internal designation 47-14-1-C3-CL0 5^ 

The protein of SEQ ID NO: 167 encoded by the extended cDNA SEQ ID NO: 
66 exhibits homology to a monkey pepsinogen A-4 precursor (Swissprot accession 
number P27678) and to related members of the aspartyl protease family. The matched 
protein belongs to a family of widely distributed proteolytic enzymes known to exist in 
vertebrate, fungi, plants, retroviruses and some plant viruses. 

Taken together, these data suggest that the protein of SEQ ID NO: 167 may play 
a role in the degradation of proteins. Thus, this protein may be useful in diagnosing 
and/or treating several types of disorders including, but not limited to, cancer, 
autoimmune disorders and immune disorders due to dysfunction of antigen presentation. 

Protein of SEP ID NO: 179 (internal designation 51-15-4-A12-CL1 1 31 

The protein of SEQ ID NO: 179 encoded by the extended cDNA SEQ ID NO: 
78 foimd in testis exhibits homology to part of mammalian colipase precursors. 
Colipases are secreted cofactors for pancreatic lipases that allow the lipase to anchor at 
the water-lipid interface. Colipase plays a crucial role in the intestinal digestion and 
absorption of dietary fats. The 5 cysteines characteristic for this protein family are 
conserved in the protein of the invention although the colipase PROSITE signature is 
not. 

Taken together, these data suggest that the protein of SEQ ED NO: 179 may 
play a role in the lipid metabolism and/or in male fertility. Thus, this protein may be 
useful in diagnosing and/or treating several types of disorders including, but not limited 
to, hyperlipidemia, hypercholesterolemia, atherosclerosis, cardiovascular disorders such 
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as coronary heart disease, and neurodegenerative disorders such as Alzheimer's disease 
or dementia, and disorders Unked to male fertility. 

Protein of SEP ID NO: 227 internal designation 33-1 1-1-Bl 1-CLl 2) 
5 The protein of SEQ ID NO: 227 encoded by the extended cDNA SEQ ID NO: 

126 exhibits extensive homology to the ATP binding region of a whole family of 
serine/threonine protein kinases belonging to the CDC2/CDC28 subfamily. The 
PROSITE signature characteristic for this domain is present in the protein of the 
invention from positions 10 to 34. 
10 Taken together, these data suggest that the protein of SEQ ED NO: 158 may 

bind ATP, and even be a protein kinase. Thus, this protein may be useful in diagnosing 
and/or treating several types of disorders including, but not limited to, cancer, 
neurodegenerative diseases, cardiovascular disorders, hypertension, renal injury and 
repair and septic shock. 

15 As discussed above, the extended cDNAs of the present invention or portions 

thereof can be used for various purposes. The polynucleotides can be used to express 
recombinant protein for analysis, characterization or therapeutic use; as markers for tissues 
in which the corresponding protein is preferentially expressed (either constitutively or at a 
particular stage of tissue differentiation or development or in disease states); as molecular 

20 weight markers on Southem gels; as chromosome markers or tags (when labeled) to 
identify chromosomes or to map related gene positions; to compare with endogenous 
DNA sequences in patients to identify potential genetic disorders; as probes to hybridize 
and thus discover novel, related DNA sequences; as a source of information to derive PGR 
primers for genetic fingerprinting; for selecting and making oligomers for attachment to a 

25 "gene chip" or other support, including for examination for expression patterns; to raise 
anti-protein antibodies using DNA immunization techniques; and as an antigen to raise 
anti-DNA antibodies or elicit another immune response. Where the polynucleotide 
encodes a protein which binds or potentially binds to another protein (such as, for 
example, in a receptor-ligand interaction), the polynucleotide can also be used in 
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interaction trap assays (such as, for example, that described in Gyuris et ah. Cell 75:791- 
803 (1993)) to identify polynucleotides encoding the other protein with which binding 
occurs or to identify inhibitors of the binding interaction. 

The proteins or polypeptides provided by the present invention can similarly be 
5 used in assays to determine biological activity, including in a panel of multiple proteins 
for high-throughput screening; to raise antibodies or to elicit another immune response; as 
a reagent (including the labeled reagent) in assays designed to quantitatively determine 
levels of the protein (or its receptor) in biological fluids; as markers for tissues in which 
the corresponding protein is preferentially expressed (either constitutively or at a particular 
10 stage of tissue differentiation or development or in a disease state); and, of course, to 
isolate correlative receptors or ligands. Where the protein binds or potentially binds to 
another protein (such as, for example, in a receptor-ligand interaction), the protein can be 
used to identify the other protein with which binding occurs or to identify inhibitors of the 
binding interaction. Proteins involved in these binding interactions can also be used to 
1 5 screen for peptide or small molecule inhibitors or agonists of the binding interaction. 

Any or all of these research utilities are capable of being developed into reagent 
grade or kit format for commercialization as research products. 

Methods for performing the uses listed above are well knovra to those skilled in 
the art. References disclosing such methods include without limitation "Molecular 
20 Cloning; A Laboratory Manual", 2d ed., Cole Spring Harbor Laboratory Press, Sambrook, 
J., E.F. Fritsch and T. Maniatis eds., 1989, and "Methods in Enzymology; Guide to 
Molecular Cloning Techniques", Academic Press, Berger, S.L. and A.R. Kimmel eds., 
1987. 

Polynucleotides and proteins of the present invention can also be used as 
25 nutritional sources or supplements. Such uses include without limitation use as a protein 
or amino acid supplement, use as a carbon source, use as a nitrogen source and use as a 
source of carbohydrate. In such cases the protein or polynucleotide of the invention can 
be added to the feed of a particular organism or can be administered as a separate solid or 
liquid preparation, such as in the form of powder, pills, solutions, suspensions or capsules. 
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In the case of microorganisms, the protein or polynucleotide of the invention can be 
added to the medium in or on which the microorganism is cultured. 

Although this invention has been described in terms of certain preferred 
embodiments, other embodiments which will be apparent to those of ordinary skill in the 
art in view of the disclosure herein are also within the scope of this invention. 
Accordingly, the scope of the invention is intended to be defined only by reference to the 
appended claims. All documents cited herein are incorporated herein by reference in theu* 
entirety. 
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TABLE I 



SEQ ID NO. in 
Present application 


Provisional Application Disclosing Sequence 


SEQ ID NO. in 
provisional application 


40 


U.S. Provisional Patent Application Serial No. 60/096,1 16, filed 
Aug. 10, 1998 


51 


41 


U.S. Provisional Patent Application Serial No. 60/081,563, filed 
Apr. 13, 1998 


72 


42 


U.S. Provisional Patent Application Serial No. 60/096,116, filed 
Aug. 10, 1998 


52 


43 


U.S. Provisional Patent Application Serial No. 60/069,957, filed 
Dec. 17, 1997 


78 


44 


U.S. Provisional Patent Application Serial No. 60/081,563, filed 
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Dec. 17, 1997 


177 


34j 


U.S. Provisional Patent Application Serial No. 60/069,957, filed 
Dec. 17, 1997 


178 


346 


U.S. Provisional Patent Application Serial No. 60/069,957, filed 
Dec. 17, 1997 


179 


347 


U.S. Provisional Patent Application Serial No. 60/069,957, filed 
Dec. 17, 1997 


180 


AO 

348 


U.S. Provisional Patent Application Serial No. 60/069,957, filed 
Dec. 17, 1997 


181 




U.S. Provisional Patent Application Serial No. 60/069,957, filed 
Dec. 17, 1997 


182 




U.S. Provisional Patent Application Serial No. 60/069,957, filed 
Dec. 17, 1997 


183 


351 


U.S. Provisional Patent Application Serial No. 60/069,957, filed 
Dec. 17, 1997 


184 


3Dz 


U.S. Provisional Patent Application Serial No. 60/069,957, filed 
Dec. 17, 1997 


185 


353 


ITS PrnviQinnal Patent Annlirntinn ^f^n'nl XTrt ^^O/n^Q 0^7 filia/^ 

Dec. 17, 1997 


1 

LOO 


354 


U.S. Provisional Patent Application Serial No. 60/069,957, filed 
Dec. 17, 1997 


187 


355 


U.S. Provisional Patent Application Serial No, 60/069,957, filed 
Dec. 17, 1997 


188 
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J JO 


u.o. rTO visional ratent Application oenal JNo. oU/U6y,9575 iilecl 
Dec. 17, 1997 


189 


JJ / 


u.o. rrovisionai ratent Application berial No. oU/0oy,y57, tiled 
Dec. 17, 1997 


190 


KQ 
DJO 


U.b. Provisional Patent Application Sena! No. 60/069,9575 fil^d 
Dec. 17, 1997 


191 


1 <o 


U.S. Provisional Patent Application Senal No. 60/069,957, filed 
Dec. 17, 1997 


192 


ioU 


U.S. Provisional Patent Application Serial No. 60/069,957, filed 
Dec. 17, 1997 


193 


361 


U.S. Provisional Patent Application Senal No. 60/069,957, filed 
Dec. 17, 1997 


194 


362 


U.S. Provisional Patent Application Serial No. 60/069,957, filed 
Dec. 17, 1997 


195 


363 


U.S. Provisional Patent Application Serial No. 60/069,957, filed 
Dec. 17, 1997 


196 


364 


U.S. Provisional Patent Application Serial No. 60/069,957, filed 
Dec. 17, 1997 


197 


365 


U.S. Provisional Patent Application Serial No. 60/069,957, filed 
Dec. 17, 1997 


1998 


366 


U.S. Provisional Patent Application Serial No. 60/069,957, filed 
Dec. 17, 1997 


199 


367 


U.S. Provisional Patent Application Serial No. 60/069,957, filed 
Dec. 17, 1997 


200 


368 


U.S. Provisional Patent Application Serial No. 60/069,957, filed 
Dec. 17, 1997 


201 


jOy 


U.b. Provisional Patent Application Serial No. 60/069,957, filed 
Dec, 17, 1997 


202 


3 /O 


U.S. Provisional Patent Application Serial No. 60/069,957, filed 
Dec. 17, 1997 


203 




U.O. i^rovisionai raieni /\ppiicaiion serial jno. ou/uoy,yj /, tiled 
Dec. 17, 1997 


204 


372 


U.S. Provisional Patent Application Serial No. 60/069,957, filed 
Dec. 17, 1997 


205 


373 


U.S. Provisional Patent Application Serial No. 60/069,957, filed 
Dec. 17, 1997 


206 
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J 


u,o. iTovisionai raieni Application serial iNo. ou/uoy,yr>/5 iiiecl 
Dec. 17, 1997 


207 


375 


U.S. Provisional Patent Application Serial No. 60/069,957, filed 
Dec. 17, 1997 


208 


376 


U.S. Provisional Patent Application Serial No. 60/069,957, filed 
Dec. 17, 1997 


209 


377 


U.S. Provisional Patent Application Serial No. 60/069,957, filed 
Dec. 17, 1997 


210 



y;i 
w 

u 

IJ1 

U'l 
rii 

m 



-163- 



TABLE II : Parameters used for each step of EST analysis 





Search Characteristics 


Selection Characteristics 


Step 


rL Ugl oUl 


ouranu 


Parameters 


laentity \/o)) 


Length (bp) 


Miscellaneous 


"Dlocfn 
JDlaSLII 


DOUl 


o— 01 A— 10 


OA 

y\j 


1 / 


tRNA 


Fab la 


UULil 




oU 


oU 


rRNA 


OIoo lii 


OOLU 


C — 1 AO 
a— lUO 


CA 
oU 


W 


mtRNA 




OOLU 


c— 1 no 
o—iVo 


OA 




Procaryotic 


Blastn 


both 


S=144 


90 


40 


Fungal 


Blastn 


both 


S=144 


90 


40 


Alu 


fasta* 


both 




70 


40 


LI 


Blastn 


both 


S-72 


70 


40 


Repeats 


Blastn 


both 


S=72 


70 


40 


Promoters 


Blastn 


top 


S-54X=16 


90 


15t 


Vertebrate 


fasta* 


both 


S=108 


90 


30 


ESTs 


Blatsn 


both 


S=108 X=16 


90 


30 


Proteins 


blastx^ 


top 


E=0.001 







* use "Quick Fast" Database Scanner 
t alignment further constrained to begin closer than lObp to EST\5' end 
^ using BLOSUM62 substitution matrix 
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TABLE III: Parameters used for each step of extended cDNA analysis 





Search characteristics 


Selection characteristics 


Step 


Program 


Strand 


Parameters 


Identity (%) 


Length (bp) 


Comments 


miscellaneous* 


PACTA 


Dom 




OA 


15 




tRNA'^ 


PACTA 


DOin 




oU 


yv 





rRNA^ 


BLASTN 


both 


S=108 


80 


40 




mtRNA* 


■DT A CTXT 


Dotn 


S=108 


80 


40 




Procat*vntic^ 


BLASTN 


both 


S=144 


90 


40 




Fungal* 


BLASTN 


both 


S=144 


90 


40 




Alu* 


BLASTN 


both 


S=72 


70 


40 


max 5 matches, 
masking 


LP 


BLASTN 


both 


S=72 


70 


40 


max 5 matches, 
masking 


Repeats^ 


BLASTN 


both 


S=72 


70 


40 


masking 


rOiy/\ 


BLAST2N 


top 


W=6,S=10,E=1000 


90 


8 


in the last 20 
nucleotides 


Poiyadenylation 
signal 


- 


top 


AATAAA allowing 1 mismatch 


in the 50 nucleotides 
preceding the 5' end of 
the polA 


Vertibrate* 


BLASTN 
then PASTA 


both 




90 then 70 


30 


first BLASTN and 
then PASTA on 
matching sequences 


ESTs* 


BLAST2N 


both 




90 


30 




Geneseq 


BLASTN 


both 


W=8, B=10 


90 


30 




ORF 


BLASTP 


top 


W=8, B=10 






on ORF proteins, max 
10 matches 


Proteins* 


BLASTX 


top 


E=0.001 


70 


30 





^ steps common to EST analysis and using the same algorithms and parameters 
5 * steps also used in EST analysis but with different algorithms and/or parameters 
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TABLE IV 



Id 


FCS Location 


SigPep Location 


Mature Polypeptide 
Location 


Stop Codon 
Location 


PolyA Signal 
Location 


PolvA Site 
Location 


40 


7 through 471 


7 through 99 


100 through 471 


472 


537 through 542 


554 through 568 


41 


168 through 332 




168 through 332 


333 


557 through 562 


- 


42 


51 through 251 


51 through 110 


111 through 251 


252 


849 through 854 


882 through 895 


43 


20 through 613 


20 through 82 


83 through 613 


614 


- 


- 


44 


12 through 416 


12 through 86 


87 through 416 


417 


425 through 430 


445 through 458 


45 


276 through 1040 


276 through 485 


486 through 1040 


1041 


- 


2024 through 2036 


46 


443 through 619 


443 through 589 


590 through 619 


620 


- 


1267 through 1276 


47 


206 through 747 


- 


206 through 747 


- 


- 


- 


48 


36 through 521 


36 through 104 


105 through 521 


522 


528 through 533 


548 through 561 


49 


36 through 395 


36 through 104 


105 through 395 


396 


599 through 604 


619 through 632 


50 


21 through 41 


- 


21 through 41 


42 


328 through 333 


357 through 370 


51 


35 through 631 


35 through 160 


161 through 631 


632 


901 through 906 


979 through 994 


52 


271 through 399 


- 


271 through 399 


400 


- 


- 


53 


103 through 252 


103 through 213 


214 through 252 


253 


- 


588 through 597 


54 


2 through 460 


- 


2 through 460 


461 


713 through 718 


735 through 748 


55 


31 through 231 


- 


31 through 231 


232 


769 through 774 


690 through 703 


56 


305 through 565 


- 


305 through 565 


566 


694 through 699 


713 through 725 


57 


124 through 873 


124 through 378 


379 through 873 


874 


1673 through 1678 


1694 through 1705 


58 


135 through 206 


- 


135 through 206 


207 


850 through 855 


1056 through 1069 


59 


135 through 818 


- 


135 through 818 


819 


909 through 914 


1071 through 1084 


60 


33 through 290 


33 through 92 


93 through 290 


291 


- 


- 


61 


485 through 616 


- 


485 through 616 


617 


- 


669 through 682 


62 


54 through 995 


54 through 227 


228 through 995 


996 


1130 through 1135 


1181 through 1191 


63 


657 through 923 


657 through 896 


897 through 923 


924 


957 through 962 


974 through 1008 


64 


18 through 311 


18 through 62 


63 through 311 


312 


- 


- 


65 


151 through 426 


151 through 258 


259 through 426 


427 


505 through 510 


527 through 538 


66 


10 through 1062 


10 through 57 


58 through 1062 


1063 


1710 through 1715 


1735 through 1747 


67 


78 through 491 


78 through 218 


219 through 491 


492 


1652 through 1657 


1673 through 1686 


68 


69 through 371 


69 through 287 


288 through 371 


372 


510 through 515 


530 through 542 


69 


2 through 757 


2 through 205 


206 through 757 


758 




11 60 through 1174 


70 


2 through 1051 


2 through 205 


206 through 1051 


1052 


1248 through 1253 


1272 through 1285 


71 


2 through 1171 


2 through 205 


206 through 1171 


1172 


1368 through 1373 


1386 through 1398 


72 


42 through 611 


42 through 287 


288 through 611 


612 


787 through 792 


808 through 821 


73 


62 through 916 


62 through 757 


758 through 916 






904 through 916 


74 


62 through 520 




62 through 520 


521 


1124 through 1129 


1141 through 1153 
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75 


21 through 167 


- 


21 through 167 


168 


- 


- 


76 


22 through 318 


22 through 93 


94 through 318 


319 


497 through 502 


516 through 526 


77 


8 through 292 


8 through 118 


119 through 292 


293 


3 17 through 322 


339 through 352 


78 


16 through 378 


16 through 84 


85 through 378 


379 


502 through 507 


522 through 542 


79 


57 through 233 


- 


57 through 233 


- 


- 


- 


80 


83 through 340 


83 through 124 


125 through 340 


341 


573 through 578 


607 through 660 


81 


47 through 541 


47 through 220 


221 through 541 


542 


- 


597 through 605 


82 


46 through 285 


46 through 150 


151 through 285 


286 


364 through 369 


385 through 396 


83 


22 through 240 


22 through 84 


85 through 240 


241 


397 through 402 


421 through 432 


84 


89 through 382 


- 


89 through 382 


383 


- 


408 through 420 


85 


80 through 415 


80 through 142 


143 through 415 


416 


471 through 476 


488 through 501 


86 


152 through 361 


152 through 283 


284 through 361 


362 


- 


- 


87 


32 through 307 


32 through 70 


71 through 307 


308 


1240 through 1245 


1261 through 1272 


88 


1 14 through 734 


114 through 239 


240 through 734 


735 


768 through 773 


793 through 804 


89 


199 through 802 


- 


199 through 802 


- 


780 through 785 


791 through 802 


90 


38 through 1174 


38 through 148 


149 through 1174 


1175 


1452 through 1457 


1478 through 1490 


91 


26 through 361 


- 


26 through 361 


- 


- 


350 through 361 


92 


3 through 131 


- 


3 through 131 


132 


- 


591 through 605 


93 


33 through 185 


33 through 80 


81 through 185 


186 


570 through 575 


586 through 591 


94 


184 through 915 


184 through 237 


238 through 915 


916 


11 19 through 1124 


1139 through 1150 


95 


58 through 1116 


58 through 159 


160 through 1116 


1117 


1486 through 1491 


1504 through 1513 


96 


327 through 417 


- 


327 through 417 


- 


- 


404 through 417 


97 


63 through 398 


63 through 206 


207 through 398 


399 


- 


- 


98 


2 through 163 


- 


2 through 163 


164 


488 through 493 


511 through 522 


99 


13 through 465 


13 through 75 


76 through 465 


466 


- 


- 


100 


20 through 703 


20 through 94 


95 through 703 


704 


1000 through 1005 


1023 through 1041 


101 


103 through 294 


103 through 243 


244 through 294 


295 


- 


- 


102 


81 through 518 


81 through 173 


174 through 518 


519 


- 


- 


103 


66 through 326 


- 


66 through 326 


327 


1066 through 1071 


1087 through 1098 


104 


170 through 289 


170 through 250 


251 through 289 


290 


- 


- 


105 


36 through 497 


- 


36 through 497 


498 


650 through 655 


663 through 685 


106 


18 through 320 


- 


18 through 320 


321 


539 through 544 


542 through 554 


107 


71 through 1438 


71 through 136 


137 through 1438 


1439 


1644 through 1649 


1665 through 1678 


108 


25 through 318 


25 through 75 


76 through 318 


319 


452 through 457 


482 through 494 


109 


84 through 332 


84 through 170 


171 through 332 


333 




702 through 714 


110 


32 through 718 


32 through 100 


101 through 718 


719 


770 through 775 


793 through 805 


111 


26 through 481 


26 through 88 


89 through 481 


482 


755 through 760 


775 through 787 
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112 


26 through 562 


26 through 187 


188 through 562 


563 


- 


- 


113 


4 through 810 


4 through 279 


280 through 810 


811 


858 through 863 


881 through 893 


114 


55 through 459 


55 through 120 


121 through 459 


460 


1444 through 1449 


1462 through 1475 


115 


48 through 248 


48 through 161 


162 through 248 


249 


283 through 288 


308 through 321 


116 


25 through 399 


25 through 186 


187 through 399 


400 


- 


- 


117 


10 through 1137 


10 through 72 


73 through 1137 


1138 


1144 through 1149 


1162 through 1173 


118 


72 through 704 


72 through 161 


162 through 704 


705 


772 through 777 


- 


119 


44 through 505 


44 through 223 


224 through 505 


506 


- 


- 


120 


25 through 393 


25 through 150 


151 through 393 


394 


734 through 739 


757 through 770 


121 


58 through 1095 


58 through 114 


115 through 1095 


1096 


- 


1202 through 1213 


122 


31 through 660 


31 through 90 


91 through 660 


661 


1288 through 1293 


1307 through 1318 


123 


31 through 582 


31 through 90 


91 through 582 


583 


816 through 821 


840 through 853 


124 


15 through 695 


15 through 80 


81 through 695 


696 


795 through 800 


814 through 826 


125 


74 through 295 


74 through 196 


197 through 295 


296 


545 through 550 


561 through 571 


126 


440 through 659 


- 


440 through 659 


- 


601 through 606 




127 


38 through 283 


38 through 85 


86 through 283 


284 


257 through 262 




128 


121 through 477 


121 through 288 


289 through 477 


- 


- 


_ 


129 


2 through 163 


- 


2 through 163 


164 


292 through 297 


310 through 323 


130 


46 through 675 


46 through 87 


88 through 675 


676 


1364 through 1369 


1383 through 1392 


131 


62 through 385 


- 


62 through 385 


386 


974 through 979 


987 through 999 


132 


422 through 550 


422 through 475 


476 through 550 


551 


- 


714 through 725 


133 


124 through 231 


- 


124 through 231 


232 


- 


387 through 400 


134 


131 through 1053 


131 through 169 


170 through 1053 


- 


1019 through 1024 


_ 


135 


86 through 403 


86 through 181 


182 through 403 


404 


1097 through 1102 


11 17 through 1128 


136 


37 through 162 


37 through 93 


94 through 162 


163 


224 through 229 


243 through 254 


137 


31 through 381 


31 through 90 


91 through 381 


382 


- 


875 through 886 


138 


46 through 579 


46 through 156 


157 through 579 


580 


- 


- 


139 


92 through 471 


92 through 172 


173 through 471 


- 


454 through 459 


458 through 471 


140 


154 through 675 


154 through 498 


499 through 675 


676 


819 through 824 


838 through 849 


242 


18 through 173 


18 through 77 


78 through 173 


174 


864 through 869 


882 through 893 


243 


17 through 595 


17 through 85 


86 through 595 


596 


820 through 825 


840 through 851 


244 


89 through 334 


89 through 130 


131 through 334 


335 


462 through 467 


484 through 495 


245 


21 through 614 


21 through 83 


84 through 614 


615 


849 through 854 


873 through 884 


246 


94 through 573 


94 through 258 


259 through 573 


574 


862 through 867 


886 through 897 


247 


74 through 397 


74 through 127 


128 through 397 


398 


472 through 477 


507 through 518 


248 


51 through 242 


51 through 116 


117 through 242 


243 


319 through 324 


339 through 350 


249 


111 through 191 


111 through 155 


156 through 191 


192 


965 through 970 


986 through 996 
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250 


45 through 602 


45 through 107 


108 through 602 


603 


828 through 833 


850 through 860 


251 


24 through 560 


24 through 101 


102 through 560 


561 


563 through 568 


583 through 593 


252 


109 through 558 


109 through 273 


274 through 558 


559 


- 


1104 through 1114 


253 


128 through 835 


128 through 220 


221 through 835 


836 


1145 through 1150 


1170 through 1181 


254 


59 through 505 


59 through 358 


359 through 505 


506 


1042 through 1047 


1062 through 1073 


255 


1 through 207 


1 through 147 


148 through 207 


208 


784 through 789 


807 through 818 


256 


12 through 734 


12 through 101 


102 through 734 


735 


914 through 919 


961 through 971 


257 


378 through 518 


378 through 467 


468 through 518 


519 


607 through 612 


628 through 640 


258 


110 through 304 


110 through 193 


194 through 304 


305 


708 through 713 


732 through 743 


259 


201 through 419 


201 through 272 


273 through 419 


420 


601 through 606 


627 through 637 


260 


123 through 302 


123 through 176 


177 through 302 


303 


1279 through 1284 


1301 through 1312 


261 


98 through 673 


98 through 376 


377 through 673 


674 


- 


1025 through 1035 


262 


17 through 463 


17 through 232 


233 through 463 


464 


657 through 662 


684 through 696 


263 


263 through 481 


263 through 322 


323 through 481 


482 


- 


858 through 868 


264 


42 through 299 


42 through 101 


102 through 299 


300 


- 


762 through 775 


265 


198 through 431 


198 through 260 


261 through 431 


432 


- 


1064 through 1074 


266 


279 through 473 


279 through 362 


363 through 473 


474 


944 through 949 


970 through 981 


267 


12 through 644 


12 through 92 


93 through 644 


645 


1002 through 1007 


1020 through 1031 


268 


91 through 459 


91 through 330 


331 through 459 


460 


- 


1271 through 1281 


269 


70 through 327 


70 through 147 


148 through 327 


328 


1741 through 1746 


1763 through 1774 


270 


12 through 497 


12 through 104 


105 through 497 


498 


935 through 940 


955 through 967 


271 


90 through 383 


90 through 200 


201 through 383 


384 


609 through 614 


632 through 643 


272 


332 through 541 


332 through 376 


377 through 541 


542 


739 through 744 


761 through 773 


273 


43 through 222 


43 through 177 


178 through 222 


223 


530 through 535 


555 through 566 


274 


115 through 231 


115 through 180 


181 through 231 


232 


419 through 424 


445 through 455 


275 


232 through 384 


232 through 300 


301 through 384 


385 


650 through 655 


662 through 673 


276 


143 through 427 


143 through 286 


287 through 427 


428 


606 through 611 


628 through 639 


277 


284 through 463 


284 through 379 


380 through 463 


464 


- 


762 through 772 


278 


162 through 671 


162 through 398 


399 through 671 


672 


805 through 810 


830 through 840 


279 


63 through 632 


63 through 308 


309 through 632 


633 


808 through 813 


829 through 840 


280 


21 through 362 


21 through 200 


201 through 362 


363 


821 through 826 


838 through 849 


281 


21 through 503 


21 through 344 


345 through 503 


504 


1305 through 1310 


1330 through 1341 


282 


1 through 201 


1 through 63 


64 through 201 


202 


637 through 642 


660 through 671 


283 


39 through 1034 


39 through 134 


135 through 1034 


1035 


1566 through 1571 


1587 through 1597 


284 


69 through 263 


69 through 125 


126 through 263 


264 


1173 through 1178 


11 96 through 1205 


285 


115 through 285 


115 through 204 


205 through 285 


286 


505 through 510 


525 through 536 


286 


90 through 344 


90 through 140 


141 through 344 


345 


500 through 505 


515 through 527 
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287 


57 through 311 


57 through 107 


108 through 311 


312 


467 through 472 


482 through 493 


288 


96 through 302 


96 through 182 


183 through 302 


303 


- 


501 through 514 


289 


161 through 526 


161 through 328 


329 through 526 


527 


- 


799 through 811 


290 


210 through 332 


210 through 299 


300 through 332 


333 


594 through 599 


613 through 625 


291 


212 through 361 


212 through 319 


320 through 361 


362 


650 through 655 


673 through 684 


292 


75 through 482 


75 through 128 


129 through 482 


483 


595 through 600 


618 through 627 


293 


50 through 631 


50 through 244 


245 through 631 


632 


777 through 782 


801 through 812 


294 


154 through 576 


154 through 360 


361 through 576 


577 


737 through 742 


763 through 775 


295 


154 through 897 


154 through 360 


361 through 897 


898 


10 17 through 1022 


1044 through 1054 


296 


146 through 292 


146 through 253 


254 through 292 


293 


395 through 400 


433 through 444 


297 


126 through 383 


126 through 167 


168 through 383 


384 


726 through 731 


743 through 754 


298 


66 through 497 


66 through 239 


240 through 497 


498 


594 through 599 


618 through 629 


299 


49 through 411 


49 through 96 


97 through 411 


412 


732 through 737 


750 through 763 


300 


49 through 534 


49 through 96 


97 through 534 


535 


593 through 598 


612 through 623 


301 


86 through 415 


86 through 145 


146 through 415 


416 


540 through 545 


560 through 571 


302 


56 through 268 


56 through 100 


101 through 268 


269 


584 through 589 


601 through 612 


303 


32 through 328 


32 through 103 


104 through 328 


329 


508 through 513 


528 through 539 


304 


21 through 527 


21 through 95 


96 through 527 


528 


921 through 926 


953 through 963 


305 


147 through 647 


147 through 374 


375 through 647 


648 


_ 


668 through 681 


306 


262 through 471 


262 through 306 


307 through 471 


472 


663 through 668 


682 through 693 


307 


74 through 1216 


74 through 172 


173 through 1216 


1217 


1627 through 1632 


1640 through 1652 


308 


48 through 164 


48 through 89 


90 through 164 


165 


482 through 487 


505 through 517 


309 


185 through 334 


185 through 295 


296 through 334 


335 


355 through 360 


392 through 405 


310 


195 through 347 


195 through 272 


273 through 347 


348 


1037 through 1042 


1071 through 1082 


311 


90 through 815 


90 through 179 


180 through 815 


816 


883 through 888 


905 through 916 


312 


52 through 513 


52 through 231 


232 through 513 


514 


553 through 558 


572 through 583 


313 


172 through 438 


172 through 354 


355 through 438 


439 


682 through 687 


685 through 697 


314 


148 through 366 


148 through 225 


226 through 366 


367 


770 through 775 


792 through 803 


315 


175 through 336 


175 through 276 


277 through 336 


337 


- 


812 through 823 


316 


191 through 553 


191 through 304 


305 through 553 


554 


766 through 771 


804 through 817 


317 


106 through 603 


106 through 216 


217 through 603 


604 


- 


1102 through 1112 


318 


47 through 586 


47 through 124 


125 through 586 


587 


1583 through 1588 


1614 through 1623 


319 


99 through 371 


99 through 290 


291 through 371 


372 


491 through 496 


513 through 524 


320 


44 through 814 


44 through 112 


113 through 814 


815 




978 through 989 


321 


3 through 581 


3 through 182 


183 through 581 


582 




1006 through 1016 


322 


107 through 427 


107 through 190 


191 through 427 


428 


499 through 504 


516 through 529 


323 


45 through 407 


45 through 83 


84 through 407 


408 


1008 through 1013 


1032 through 1042 
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324 


201 through 332 


201 through 251 


252 through 332 


333 


- 


869 through 880 


325 


217 through 543 


217 through 255 


256 through 543 


544 


- 


1206 through 1217 


326 


18 through 446 


18 through 140 


141 through 446 


447 


930 through 935 


948 through 959 


327 


29 through 724 


29 through 118 


119 through 724 


725 


886 through 891 


910 through 920 


328 


404 through 586 


404 through 466 


467 through 586 


587 


1304 through 1309 


1334 through 1344 


329 


331 through 432 


331 through 387 


388 through 432 


433 


548 through 553 


573 through 585 


330 


59 through 703 


59 through 220 


221 through 703 


704 


886 through 891 


903 through 914 


331 


672 through 752 


672 through 722 


723 through 752 


753 


- 


1150 through 1161 


332 


57 through 311 


57 through 128 


129 through 311 


312 


332 through 337 


351 through 363 


333 


80 through 232 


80 through 127 


128 through 232 


233 


617 through 622 


634 through 645 


334 


91 through 291 


91 through 219 


220 through 291 


292 


367 through 372 


389 through 400 


335 


196 through 384 


196 through 240 


241 through 384 


385 


461 through 466 


485 through 496 


336 


54 through 590 


54 through 227 


228 through 590 


591 


- 


955 through 965 


337 


133 through 846 


133 through 345 


346 through 846 


847 


- 


890 through 901 


338 


138 through 671 


138 through 248 


249 through 671 


672 


1319 through 1324 


1338 through 1347 


339 


124 through 411 


124 through 186 


187 through 411 


412 


948 through 953 


971 through 983 


340 


372 through 494 


372 through 443 


444 through 494 


495 


708 through 713 


732 through 745 


341 


112 through 450 


112 through 192 


193 through 450 


451 


1053 through 1058 


1095 through 1106 


342 


1 17 through 866 


117 through 170 


171 through 866 


867 


1159 through 1164 


1178 through 1190 


343 


13 through 465 


13 through 75 


76 through 465 


466 


1035 through 1040 


1060 through 1070 


344 


2 through 718 


2 through 76 


77 through 718 


719 


1170 through 1175 


1203 through 1213 


345 


86 through 709 


86 through 361 


362 through 709 


710 


943 through 948 


963 through 973 


346 


63 through 320 


63 through 179 


180 through 320 


321 


771 through 776 


799 through 810 


347 


299 through 418 


299 through 379 


380 through 418 


419 


739 through 744 


762 through 771 


348 


186 through 380 


186 through 233 


234 through 380 


381 


383 through 388 


396 through 409 


349 


69 through 458 


69 through 233 


234 through 458 


459 


564 through 569 


602 through 613 


350 


12 through 638 


12 through 263 


264 through 638 


639 


951 through 956 


975 through 985 


351 


282 through 389 


282 through 332 


333 through 389 


390 


1413 through 1418 


1437 through 1447 


352 


208 through 339 


208 through 294 


295 through 339 


340 


- 


1631 through 1641 


353 


69 through 557 


69 through 224 


225 through 557 


558 


849 through 854 


870 through 883 


354 


134 through 325 


134 through 274 


275 through 325 


326 


- 


718 through 729 


355 


78 through 731 


78 through 227 


228 through 731 


732 


- 


1002 through 1013 


356 


46 through 693 


46 through 90 


91 through 693 


694 


937 through 942 


962 through 973 


357 


126 through 527 


126 through 182 


183 through 527 


528 


834 through 839 


856 through 867 


358 


66 through 320 


66 through 113 


114 through 320 


321 


490 through 495 


508 through 519 


359 


73 through 948 


73 through 159 


160 through 948 


949 




1016 through 1028 


360 


69 through 434 


69 through 236 


237 through 434 


435 


419 through 424 


441 through 452 
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361 


628 through 804 


628 through 711 


712 through 804 


805 


- 


864 through 875 


362 


70 through 366 


70 through 108 


109 through 366 


367 


496 through 501 


521 through 531 


363 


70 through 366 


70 through 108 


109 through 366 


367 


- 


1233 through 1244 


364 


111 through 434 


111 through 185 


186 through 434 


435 


- 


618 through 631 


365 


19 through 567 


19 through 63 


64 through 567 


568 


749 through 754 


771 through 781 


366 


19 through 312 


19 through 63 


64 through 312 


313 


896 through 901 


921 through 931 


367 


64 through 612 


64 through 234 


235 through 612 


613 


- 


839 through 849 


368 


39 through 458 


39 through 80 


81 through 458 


459 


613 through 618 


633 through 644 


369 


9 through 185 


9 through 50 


51 through 185 


186 


- 


906 through 918 


370 


14 through 3 16 


14 through 121 


122 through 316 


317 


442 through 447 


458 through 471 


371 


70 through 1092 


70 through 234 


235 through 1092 


1093 


1475 through 1480 


1493 through 1504 


372 


274 through 597 


274 through 399 


400 through 597 


598 


731 through 736 


754 through 765 


373 


230 through 469 


230 through 307 


308 through 469 


470 


1004 through 1009 


1027 through 1040 


314 


72 through 545 


72 through 203 


204 through 545 


546 




1151 through 1162 


375 


36 through 425 


36 through 119 


120 through 425 


426 


1215 through 1220 


1240 through 1250 


376 


155 through 751 


155 through 340 


341 through 751 


752 


912 through 917 


937 through 947 


377 


46 through 585 


46 through 120 


121 through 585 


586 


584 through 589 


606 through 619 
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lU 


r uii ijcngiii X uiypcpiiuc 


Signal Peptide LocHtlon 


Mature Polypeptide 




T .nrfif inn 






141 


-'^l throiifrh 174 


-^1 tfirr^ncrli —1 
"Ji UUKJKl^l -1 


1 Liirougn 1Z4 


142 


1 tVirnncti SS 




1 inrougn jj 


143 


-20 throimh 47 


-90 thrnnffh -1 

-Zv UIlOU^l -1 


1 f li mn rrl-i Al 

1 inrougn 4 / 


144 


-21 thronah 177 

Z< 1 Kill \J Ligil 1 / / 


-91 tVirmiali _1 

-zi uiiuugn -1 


1 inrougn i / / 


145 


thrnnah 1 10 


-9S thrmiali _1 

-zj uiruu^i -1 


1 inrougn 1 1 u 




-70 thrnnah 1 SS 


-70 tVirr\iifTli 1 

- / u inrougn - 1 


1 tnrougn loj 




-H-y inrougn lu 


-4y inrougn -i 


I through 10 




1 xnrougn loU 




1 through 180 


MO 


-zj inrougii ijy 


-Z3 inrougn -i 


1 through 1J9 


ISO 


-Zj llliUUgll / 


-Zj inrougn -i 


1 through y / 


IJ 1 


1 Liirougii / 




1 through 7 




49 flifrtnrrli 1 

-^z Lnrougn i J / 


-4z tnrougn -i 


1 through 157 




1 lnrougn 4 J 




1 through 43 


1 


-J / lnrougn i j 


-i / through - 1 


1 through 13 




1 tnrougn iji 




1 through 153 


1 JO 


1 tnrougn o7 




1 through 67 


1 

O / 


1 through 87 


■ 


1 through 87 


1 CO 
1 JO 


-85 through 165 


-85 through -1 


1 through 165 


1 CO 

1 jy 


1 through z4 




1 through 24 




1 tnrougn zzo 




1 through 228 


101 


-zu lnrougn oo 


-zO through - 1 


1 through 66 


lOZ 


1 inrougn 44 




1 through 44 


lOj 


-JO tnrougn Z30 


-58 through -1 


1 through 256 


1 f^A 
lO'f 


-ou inrougn y 


-oU through -1 


1 through 9 


1 

lOJ 


-ID tnrougn oj 


-1 J through -1 


1 through 83 


1 

100 


-36 through 56 


-35 through -1 


1 through 56 


10 / 


-10 inrougn jjd 


-16 through -1 


1 through 335 


1 Aft 
lOo 


-4 / inrougn y i 


-47 through -1 


1 through 91 


1 AO 

loy 


- /j inrougn zo 


-/j through -1 


1 through 28 


170 
1 /u 


-Oo inrougn i o4 


-Do tnrougn -i 


1 through 184 


171 

J. / 1 


-AS tVirrmcrli 9S9 
-Oo Liirougu ZOZ 


-Oo tnrougn -i 


1 through ZOZ 


179 

1 /Z 


AS flirrtiifrfi '599 

-Oo tnrougn jzz 


-oo tnrougn -1 


1 through 322 


17'^ 


-S9 tlirnnoli 1 OS 

-oz uirougn luo 


-oz tnrougn -i 


1 through lOo 


174 


-zjz imuugn jj 


-zjz inrougn -i 


1 through d5 


17S 

1 / J 


1 tVimiicrt-i 1 




1 through 153 


176 


1 through 49 




1 thrniicrli 40 


177 


-24 through 75 


-24 through -1 


1 through 75 


178 


-37 through 58 


-37 through -1 


1 through 58 


179 


-23 through 98 


-23 through -1 


1 through 98 


180 


1 through 59 




1 through 59 


181 


-14 through 72 


-14 through -1 


1 through 72 


182 


-58 through 107 


-58 through -1 


1 through 107 


183 


-35 through 45 


-35 through -1 


1 through 45 
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184 


-21 through 52 


-21 through -1 

X> J. UXX WUgll X 


1 thrmmh S9 

1 LlUUUgll JZ 


185 


1 throush 98 




1 thmnah 052 


186 


-21 through 91 


-21 through -1 

*^ X ^XXX \J U^^XX X 


1 throiifrli 01 


187 


-44 through 26 


-44 throuffh -1 


1 thrmi&h 96 

X I'lllUUgil M^\J 


188 


-13 through 79 


-13 through -1 

X w' UXX \J\A^^M.l. X 


1 through 7Q 

1 iiiiv/u^ii / y 


189 


-42 through 165 


-42 through -1 

^T*<^ VXXX \J 1 X X 


1 lllX^^ligll \.\J^ 


190 


1 through 201 




1 throimh 901 

1 lllllJU^l Zv 1 


191 


-37 through 342 


-37 through -1 


1 thrniiph '?49 

X LlXXWUgll 


192 


1 through 112 




1 thrniiah 1 19 

X Llll U UglX 11^ 


193 


1 through 43 




1 thrnnpli 4"^ 

1 UllWLl^ll '-vJ 


194 


-16 through 35 


-16 tliroiicfli -1 
X ^J ixiiuugii 1 


i llliUU^l jO 


195 


-18 through 226 


-18 through -1 

XU U.XXv/U'&Xl 1 


1 thmiioh 996 

1 LiilUU^li ZZ\J 


196 


-34 through 319 


-'^4 thrmidh -1 


1 lilt (JUgll O Ly 


197 


1 throush 30 




1 thrmiali '?0 

1 LlilUU^Il j\j 


198 


-48 through 64 


-48 thrniiah -1 

^t> LlllUUgll 1 


1 lIllUU^Il OH- 


199 


1 through 54 




1 llilUU^Il 


200 


-21 through 130 


-91 thrniifrh -1 

1 1111 \J Ugll 1 


1 thrmiah 1 '^0 

1 llUUU^l VOX) 


201 


-25 through 20'^ 


-9S thrnnoli -1 

llllVJU^ll "1 


1 +fimnfrVi 001 
1 lUlOUgn. ZAJj 


202 


-47 through 17 


-47 thrnnoVi -1 


1 uirougn 1 / 


203 


-3 1 through 1 1 5 


-"^ 1 tlimnfrfi —1 
'J 1 Llll UU^ll - 1 


1 inrougJi 1 1 D 


204 


1 thrniKrh 5^7 

1 liJluUgil o / 




1 inrougn o / 


205 


-97 thrmiab 1 


-97 tfirnnali _1 


1 through 13 


206 


1 thrniioh 1 S4 




1 inrougn 1j4 


207 


1 thrniicrh 101 




1 tnrougn lui 


208 


— 1111 vju-^ii 'tj't 




1 inrougn 4J4 


209 


-17 thmnah 81 

-1 / LillV^Ugll Ol 


- 1 / inrougn - 1 


1 inrougn is 1 


210 


-29 through 54 


-9Q thrnnah -1 

llllUU^l -1 


1 inrougn 34 


211 


-23 through 906 

J——' till \J 14.^1 i ^W\J 


-Z-J LlllUUgXl -1 


1 inrougn zuo 


212 


-21 throueh 131 


-91 thrnnah -1 

lillUUgil -1 


1 fl-irrkiifrVi 1 '3 1 

1 inrougn ioi 


213 


-54 through 125 


tHrniiah -1 

.J*T UlLUUgll -1 


1 inrougn izj 


214 


-92 through 177 


-Q9 thrnnah -1 

LlllvJUgll i 


1 tVirrtncrVi 1 77 


215 


-22 through 1 13 


-99 thfrtiicrli —1 

Zif LlllUU^ll i 


1 inrougn no 


216 


-38 throu&h 29 

■/u uixv/u.^11 A^y 




1 inrougn zy 


217 


-54 through 71 


-^4 tliiTincrfi —1 
-Jt- UiliJUgil -1 


1 inrougn / 1 


218 


-21 through 355 

^ X LXIX \J U-C^XX _^ ^ ^ 


—9 1 I'n'mii CTri _ 1 

-Z 1 llll LrUgll 1 


1 inrougn djj 


219 


-30 throueh 181 

V wXXX V/ vl^^^Xl X X 


-'^0 thrniifrVi -1 

— '\y tlllvrllgll 1 


1 flirr»iirr1i 1 521 
1 llnOUgn loi 


220 


-60 through 94 

\J \J V-J.XX \J LXw.lX ^ 1^ 


-60 thrrnmh -1 


1 i\\T^\^^cr\^ OA 

1 imougn y4 


221 


-42 through 8 1 


-49 thrmiph -1 

llilWLX^ll 1 




222 


-19 through 327 


-19 through -1 


1 throimh '^97 

1 llliUUgll JZ/ 


223 


-20 through 190 


-20 through -1 


1 through 190 


224 


-20 through 164 


-20 through -I 


1 through 164 


225 


-22 through 205 


-22 through -1 


1 through 205 


226 


-41 through 33 


-41 through -1 


1 through 33 


227 


1 through 73 




1 through 73 


228 


-16 through 66 


-16 through -1 


1 through 66 


229 


-56 through 63 


-56 through -1 


1 through 63 
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230 


1 through 54 




1 through 54 


231 


-14 through 196 


-14 through -1 


1 through 196 


232 


1 through 108 




1 through 108 


233 


-18 through 25 


-18 through -1 


1 through 25 

X \tXXX \J V*-^^ 1 X 


234 


1 through 36 




1 through 36 


235 


-13 through 294 


-13 through -1 


1 through 294 

L iriiiv^ugii 


236 


-32 through 74 


-32 through -1 


1 through 74 


237 


-19 through 23 


-19 through -1 


1 through 23 

X %iXXX\J\A^^XX 


238 


-20 through 97 


-20 through -1 


1 through 97 


239 


-37 through 141 


-37 through -1 


1 through 141 


240 


-27 through 99 


-27 through -1 


1 through 99 


241 


-115 through 59 


-115 through -1 


1 through 59 

X txxx v/ugxx 


378 


-20 through 32 


-20 through -1 


1 through 3? 


379 


-23 through 170 


-2^ through -1 


1 thrnuah 170 


380 


-14 through 68 


-14 through -1 




381 


-21 through 177 


-71 through -1 


1 thrnuah 177 


382 


-55 through 105 


-55 through -1 


1 thrnuah 105 


383 


-18 throueh 90 


-1 8 through -1 


1 thrnnab 00 


384 


-22 through 42 


-22 through -1 


1 thrmiaf) J.9 

1 llilL'U^il T-Zi 


385 


-15 throush 12 


-15 thrnncrh -1 


1 fVirrmali 1 9 


386 


-21 throueh 165 


-21 thrniigh -1 


1 thrnuoh 165 


387 


-26 through 1 53 


-96 thrniigh -1 


1 tVirniioh 1 5^^ 


388 


-55 through 95 


-55 thrnuffh -1 

^ .J till ^^Ugli 1 


1 tfirnnah 05 


389 


-3 1 through 705 


- J 1 llllVJugll -1 


1 tlimno-h 

1 uu^ougn ZUJ 


390 


-100 through 49 

X\J\J \fXXX\J\X^^XX ^ 


-100 thrmiph -1 

IW lllll/Lt^ll 1 


1 thrnuah AO 


391 


-49 through 20 


-49 through -1 


1 thrnuah 70 


392 


-30 through 2 1 1 

*t/\J VIXX \J V*. ^^XX ^ X X 


-'^0 thrnncrh -1 

•J\J UllUUgll 1 


1 UllOUgiiZlX 


393 


-30 through 17 


-30 through -1 

•J\J llll V/U^ll 1 


1 thrnuah 1 7 


394 


-28 through 37 


-28 through -1 

llllV/IA^ll 1 


1 thrnuah 


395 


-24 through 49 


-24 through -1 


1 thrnuah 40 


396 


-18 through 42 


-18 through -1 


1 thrnugh 47 


397 


-93 through 99 


-93 through -1 

y J llllWLL^ll 1 


1 thrnuah 00 


398 


-72 through 77 


-72 through -1 

/ ^ llllWU^ll —1 


1 thrnuah 77 


399 


-20 through 53 


-20 through -1 


1 thmiiah ^'X 

1 lliiV/U^i OD 


400 


-20 through 66 


-20 through -1 

till V/^tlll 1 


1 thrnuah f\f\ 

1 LillUU^i 


401 


-21 through 57 


-21 throiiCFh -1 

^ 1 till v/ Hal 11 1 


1 thrmiah 57 


402 


-28 through 37 


-28 through -1 

UllUU^ll 1 


1 thrnuah XI 
1 uuuugu ^ / 


403 


-27 through 184 


-27 through -1 

^ / \tXxx vvi^^Xl X 


1 thrnuah 184 


404 


-80 through 43 


-80 through -1 


1 through 43 

X U.ll\JUgll 


405 


-26 through 60 


-26 through -1 


1 through 60 


406 


-31 through 131 


-31 through -1 


1 through 131 


407 


-37 through 61 


-37 through -1 


1 through 61 


408 


-15 through 55 


-15 through -1 


1 through 55 


409 


-45 through 15 


-45 through -1 


1 through 15 


410 


-22 through 17 


-22 through -1 


1 through 17 


411 


-23 through 28 


-23 through -1 


1 through 28 
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412 


-48 through 47 


-48 through -1 

i\J UllV/U^ll 1 


1 thrnncrfi 47 

1 llllUU^ll *+ / 


413 


-32 through 28 


-32 through -1 

UU UUgjll 1 


1 through 0% 

1 Ullt^U^ll ZrO 


414 


-79 throush 91 


-79 through -1 

/ tiiiuu^ii 1 


1 thrmityh 01 
1 uiiLrugii yv 


415 


-82 through 108 


-82 through -1 

\3£^ 1111^14^11 1 


1 thronah IftS 

1 lllllJUgii lUO 


416 


-60 through 54 


-6ft thrrniffh -1 

\J\f LlUUUgll 1 


1 UXZUUgll J^ 


417 


-108 through 53 


-108 through -1 

1 V/ O Ull \J U.^11 1 


1 frn*rtn&Vi 

X UllLrU^i J J 


418 


-2 1 through 46 


-91 thrniiah -1 

LlllUU^ll 1 


1 llilUllgll HO 


419 


-32 through 300 


-^^9 fhrmicrli -1 

JZ LlllUUgll "1 


1 tVirmio-Ti '?ftft 

1 iiiruugn juu 


420 


-19 throuffh 46 


-10 thrnnafi -1 


1 inrougn ho 


422 


-30 through 27 


JLf UllVJU^ll 1 


1 LliliJUgU i 


423 


-17 through 68 


-17 thrnno'h -1 

1 / l.lllV7U.gii 1 


1 throiiali 68 


424 


-17 through 68 


-17 through -1 

1 / LlllV^Ugll 1 


1 thmiioh 6R 

1 UUUU^il V/O 


425 


-9Q thrniifrh 40 


-90 thmiiffh -1 
iiiiL^u^ii -1 


1 fVir/MirrVi 4ft 

1 inrougn ho 


426 




-^6 fVirrtiirrli _1 

-ju Liirougii -1 


1 inrougn oo 


427 




-ju Liiruugii -1 


1 inrougn 1 1 




-'^6 tVirmicrh 14 


-jo inrougn -i 


1 through 14 




-lO lllXUUgli 1 lo 


- 1 o inrougn - 1 


1 inrougn 1 1 o 






-65 through -1 


1 tnrougn izy 




-oy tiirougn. /z 


-oy inrougn -i 


1 tnrougn 72 




AO fVirrtiifrVi 1 70 

-Qy inruugn i /y 


-oy inrougn -i 


1 tnrougn l/y 




-JO UlTOUgJl U 


-36 through -1 


1 tnrougn 13 




-14 tVirr*iicrVi 77 
-IH- UlTOUgJl /Z 


- 1 H inrougn - 1 


1 tnrougn /z 


4^^ 


-SS thrnnah S6 
-JO LlllUUgll ou 


-JO inrougn -i 


1 tnrougn oo 




- 1 6 tfirnn all 1 


- 1 0 inrougn - 1 


1 tnrougn iuj 


437 


- 1 A ffi rrrt 1 crli 1 A.f\ 


-lo Lxirougn -i 


1 through 146 


4^8 


-9 ft tlirrviiaVt Oft 


Oft fV>-rr\iirrV» 1 

-zu inrougn -i 


1 tnrougn yyj 


4^Q 


-1 J LlirUUgn JO 


- 1 5 through - 1 


1 tnrougn 3o 


440 


-94 tVirrtiioK 7^ 


-Z4 inrougn -i 


1 tnrougn 75 


441 


-9^ tVirniiali 144 

-zj uirougn ih-^ 


-z J inrougn - 1 


1 tnrougn 144 


449 


-76 tlirnnati 01 


- / 0 inrougn - 1 


1 tnrougn y i 


44 


-1 J LiilUUgli J J 


- 1 J inrougn - 1 


1 tnrougn J J 


444 


-'^'^ thrmicrh 148 

J J LillUU&il JtO 


-J J luruugn -1 


1 inrougn j4o 


445 


-14 throimh 95 


- 1 4 tlirrn 1 crli — 1 
It- lilliJU^l -1 


1 inrougn z j 


446 


J / till yJXX^^l 1 J 


JXl tVirnntrli -1 
-J / llllUUgll -1 


1 inrougn ij 


447 


-96 thrrnicrh 9S 


-96 ffirr\iiaV» _1 


1 inrougn z j 


448 


-'^0 thrmiali 919 

-J V/ nil \J Ugll ^ 1 Zr 


-ju inrougn -i 


1 tnrougn ziz 


449 


-6 ft thmiiaVi 04 


Aft fVtr/Mirrli 1 

-ou inrougn - 1 


i tnrougn y4 


450 


-61 thrrtiiffh 9R 

-U 1 LlllUUgll ZO 


-oi inrougn -i 


1 tnrougn Zo 


451 


-96 throiiph 47 


-96 tbrniiCFli -1 


1 inrougn h / 


452 


-34 through 20 


-34 through -1 


1 through 20 


453 


-38 through 83 


-38 through -1 


1 through 83 


454 


-37 through 129 


-37 through -1 


1 through 129 


455 


-26 through 154 


-26 through -1 


1 through 154 


456 


-64 through 27 


-64 through -1 


1 through 27 


457 


-23 through 234 


-23 through -1 


1 through 234 


458 


-60 through 133 


-60 through -1 


1 through 133 
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459 


-78 throiiph 79 


-78 thrnnah -1 

^O UllUUgll -1 


1 thrnnah 70 
1 llllUUgll /y 


460 




-1'^ thrnnah -1 
"ij LlllUUgll -1 


1 thrnnah lft8 

1 llllUU^ll iUO 


461 


-17 through 27 


-17 thrnnah -1 

1 / nil yjix^n X 


1 thrnnah 77 

X UXlUUgll z. / 


462 


- 1 3 throuffh 96 

J. a/ LIU. V/ Vigil. ^V/ 


-1'^ thrnnah -1 


1 thrnnah 96 
1 iixiuugii ysj 


463 


-41 throi]?h 102 


-41 thrnnah -1 

LlllUli^ll 1 


1 thrnnah lft7 

1 llllUUgll IvZ 


464 


-30 throuffh 202 

V/ UllVUgll ^ 


-'^0 thrnnah -1 


1 thrnnah 707 

X LlXlUUgll ZiVZ 


465 


-21 through 40 


-71 thrnnah —1 

^ 1 Llll U Ugil " 1 


1 thrnnah A(\ 
I UllUUgll Hv 


466 


-19 thrniicrh 1 ^ 


- 1 0 thrm 1 ah _ 1 
~ ly Liiiuugii -1 


1 +hT*/MlfTh 1 ^ 

1 inrougn ij 


467 


-'S4 tfirrmah 161 


-^4 thrnnah _1 
-JH llliUUgll -1 


1 thrnncrh 1A1 

1 inrougn loi 


468 


-17 thrnnah 1 ft 

" 1 / nil KJ Ugll 1 V/ 


-17 thrnncrh —1 
"1 / LlllUUgll "1 


1 thrmifrh 1 ft 

1 inrougn lu 


469 


-74 thrniioh 61 


—74 thfrtTifrh _1 
-ZH UUUUgll -1 


1 inrougn o i 


470 


-16 thrniioh '^^ 

"lU LlllUUgii J J 


-lu niruugii "i 


1 inrougn do 


471 




thrnnah —1 
"H J Llll UUgll - 1 


1 thrmicrh 7 A 

i inrougn zh 


472 


-1 S thrniioh 48 


-1 ^ thrnnah -1 

- 1 J Llll U Ugll 1 


1 thrnncrh ASi 

1 inrougn ho 


47^^ 


-^8 thrnnah 171 

"JO 1111 UU^l 1^1 


-^8 thrnno-h _1 
-Do LlllOUgn -1 


1 fhrrtnn-fi 171 

1 inrougn izi 


474 


-71 thrnnah 1^7 
" / 1 Ull UUgll 1 0 / 


- / 1 inrougn - 1 


1 inrougn lo/ 


*4 




-J / inrougn -i 


1 1/11 

1 tnrougn 14 1 


H /O 


-zi inrougn /j 


-21 through -1 


1 tnrougn Id 


All 


-zh inrougn i / 


-24 through -1 


1 tnrougn 1 / 


H /O 


-z / inrougn oo 


-z/ inrougn -1 


1 tnrougn oo 




- 1 o inrougn z^z 


-16 inrougn -1 


1 through 232 




-zi inrougn iju 


-ZI tnrougn -l 


1 tnrougn 13U 


Ho 1 


-7^ tVimiicrV* 71 A 

-zj inrougn zih 


-zj inrougn -i 


1 inrougn zi4 




-07 thrnnah ^ 


-yz inrougn -i 


1 inrougn i lo 


TO J 


-'^0 thrnnah Al 


-jy inrougn -i 


1 ■fl^Y'rturrl-i /IT 

1 inrougn 4 / 




-z / inrougn i j 


-27 through -1 


1 tnrougn 13 




-10 inrougn 


- 1 6 through - 1 


1 tnrougn 4y 




-J J inrougn 


-jd inrougn -i 


1 tnrougn id 


dS7 


-OH- inrougn izd 


-o4 inrougn -i 


1 tnrougn lz5 




-1 / inrougn ly 


- 1 / tnrougn - 1 


1 tnrougn 19 




70 thrmicrh 1 ^ 

-zy uirougn i j 


-zy inrougn -i 


1 through 15 


490 


-S7 thrnnah 1 1 1 

-JZ LlliUUgll 111 


-jz inrougn -i 


1 inrougn 1 1 1 


491 


-il7 thrmiah 17 

-H / iiiruugii 1 / 


-^■ / inrougn - 1 


1 inrougn i / 


497 


-SO thrnnah 1 ^8 

"JU LlllUUgll lOO 


inrougn -i 


1 inrougn loo 


49'? 


-1 ^ thrnnoh 7ft 1 
-IJ UliUUgll ZV/1 


- 1 -) inrougn - 1 


1 inrougn zu i 


494 


-10 thrnncrh 1 1 ^ 


- 1 y inrougn - 1 


1 inrougn i o 


49 S 


-16 thrnnah ^^0 
-lo imuugii oy 


-lo inrougn -i 


1 inrougn oy 


496 


-70 thrnnah 7 A'? 
"Z" Uxxuugn zoj 


70 thr/Mirrh 1 

-zy inrougn -i 


1 inrougn zoo 


497 

H7 / 


— thfmifTh 
-JU liilUugll UU 


-JO inrougn -i 


1 through 66 


498 


-28 through 31 


-28 through -1 


1 through 3 1 


499 


-13 through 86 


-13 through -1 


1 through 86 


500 


-13 through 86 


-13 through -1 


1 through 86 


501 


- -25 through 83 


-25 through -1 


1 through 83 


502 


-15 through 168 


-15 through -1 


1 through 168 


503 


-15 through 83 


-15 through -1 


1 through 83 


504 


-57 through 126 


-57 through -1 


1 through 126 
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505 


-14 through 126 


-14 through -1 


1 through 126 


506 


-14 through 45 


-14 through -1 


1 through 45 


507 


-36 through 65 


-36 through -1 


1 through 65 


508 


-55 through 286 


-55 through -1 


1 through 286 


509 


-42 through 66 


-42 through -1 


1 through 66 


510 


-26 through 54 


-26 through -1 


1 through 54 


511 


-44 through 114 


-44 through -1 


1 through 114 


512 


-28 through 102 


-28 through -1 


1 through 102 


513 


-62 through 137 


-62 through -1 


1 through 137 


514 


-25 through 155 


-25 through -1 


1 through 155 



m 
hk 

m 

yj 
in 

u 

m 

u 
H 

m 
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Id 


Collection Refs 


Deposit Name 


40 


ATCC# 98921 


SignalTag 12M44 


41 


ATCC # 98922 


SignalTag 91-94, 96, 97, 99-107, 109-112 et 114-120 


42 


ATCC# 98921 


SignalTag 121-144 


43 


ATCC # 98920 


SignalTag 67-90 


44 


ATCC # 98922 


SignalTag 91-94, 96, 97, 99-107, 109-112 et 114-120 


45 


ATCC # 98920 


SignalTag 67-90 


46 


ATCC # 98923 


SignalTag 44-66 


47 


ATCC # 98920 


SignalTag 67-90 


48 


ATCC # 98922 


SignalTag 91-94, 96, 97, 99-107, 109-112 et 114-120 


49 


ATCC # 98922 


SignalTag 91-94, 96, 97, 99-107, 109-112 et 114-120 


50 


ATCC #98921 


SignalTag 121-144 


51 


ATCC #98921 


SignalTag 121-144 


52 


ATCC # 98920 


SignalTag 67-90 


53 


ATCC # 98923 


SignalTag 44-66 


54 


ATCC # 98920 


SignalTag 67-90 


55 


ATCC # 98920 


SignalTag 67-90 


56 


ATCC # 98920 


SignalTag 67-90 


57 


ATCC #98921 


SignalTag 121-144 


58 


ATCC # 98920 


SignalTag 67-90 


59 


ATCC # 98920 


SignalTag 67-90 


60 


ATCC # 98920 


SignalTag 67-90 


61 


ATCC # 98923 


SignalTag 44-66 


62 


ATCC # 98923 


SignalTag 44-66 


63 


ATCC # 98923 


SignalTag 44-66 


64 


ATCC # 98922 


SignalTag 91-94, 96, 97, 99-107, 109-112 et 114-120 


65 


ATCC # 98923 


SignalTag 44-66 


66 


ATCC #98921 


SignalTag 121-144 


67 


ATCC # 98920 


SignalTag 67-90 


68 


ATCC # 98920 


SignalTag 67-90 
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69 


ATCC # 98921 


SiffnalTag 121-144 


70 


ATCC #98921 


Signal Tag 121-144 


71 


ATCC #98921 


SignalTag 121-144 


72 


ATCC # 98922 


SisnalTaff 91-94 96 97 99-107 100-112 pt 1 14-120 


73 


ATCC # 98923 


SifftialTaff 44-66 


74 


ATCC # 98923 


SiffnalTa^ 4d-ft6 


75 


ATCC # 98920 


SiVnalTa<r 67-00 


76 


ATCC # 98922 


RicrnalTaa 01-Od 06 07 00-107 1 00-1 1 2 1 1 4 170 


77 


ATCr # 98922 


^lannlTsicr 01 -04 06 07 00 107 1 OQ 1T>»1-114 lOn 


78 


ATCr # Q8Q21 




79 


A TCP # 9892^ 




80 


ATrr ^ 08022 




81 


ATCr # 08021 
rA. 1 \^v_^ ft yo y^ I 


<\ionnlTaa 121 144 




ATCC a 08020 


oignai 1 ag 0 / -yu 




ATCC it 0RQ22 


QifrncjITorr 01 QA QA 07 QQ 1 m 1 AO 1 1 0 0+ 1 1 yl 10A 

oignaiiag yi-v4, yo, y /, yy-iu /, iuy-i iz et 1 14-lzO 


84 


ATCC a 0802^ 


QirrnalTorr A A AA 

olgnal 1 ag 44-00 


85 


ATCC a 0802^ 


QicTTiQlXafr AA AA 

Olgnal 1 ag 44-00 




ATCC a 


Cio-MolXart 0 1 QA QA 07 OQ 1 f\n 1 AO t ^ a.4- 1 1 A 1 0A 

oignaiiag y i-y4j yo, y yy-iu /, luy-i iz et i i4-izU 


87 


ATCC a 0802'; 


QianiilTcirr AA AA 
olgnaliag 44-00 


88 


ATCC U 0802^? 


QiariialTiifT AA AA 
olgnal 1 dg *4*4-00 


89 


ATCC # 0802'^ 


CiarnalTncT 44_AA 
olgnal 1 ag HH— OO 


90 


ATCC a 0802 


^icrnnlTarr AA AA 
Olgllal 1 dg 44-00 


91 


ATCC a 0802'? 


QlfTnalXcirr AA AA 

oigndi 1 ag 44-00 


99 


ATCC U 08020 


QifrriQlT'arr A7 QO 

oignai 1 ag o / -y u 


93 


ATCC # 98022 


^iannlTno 01 -04 QA Q7 QQ 107 1 AO 117^^-11/1 17A 

oigndiidg y i-y45 yo, y /, yy-iu /, luy-i iz et l l4-lzU 


94 


ATCC # 98923 


SipnalTacr 44-66 


95 


ATCC # 98923 


SignalTag 44-66 


96 


ATCC # 98920 


SignalTag 67-90 


97 


ATCC # 98922 


SignalTag 91-94, 96, 97, 99-107, 109-1 12 et 1 14-120 


98 


ATCC #98921 


SignalTag 121-144 


99 


ATCC # 98922 


SignalTag 91-94, 96, 97, 99-107, 109-112 et 1 14-120 
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100 


ATCC # 98991 


Si^nalTacr 191-144 


101 


ATCC # 98920 




102 


ATCr # 98977 


^ianalTaa 01 -04 06 07 00-107 100 1 19 pt 114 190 
oigiiaii 7 i-^H, yo, y / 5 yy- lu / , i i/y- 1 iz ci i ih— xzu 


10'? 


ATCC a 08999 


<\ianalTao 01-04 06 07 00-107 100 119^^1 Id 190 

oiguai 1 ag y i -yH- j y 0, y / , y y- 1 u / j i uy- 1 1 z ei 1 1 izu 




ATCC # 08099 


^ianalTao 01-04 06 07 00-107 100 119Af 114 190 

oigndiidg y i-y^5 yo, y /, yy-iu /, luy-i iz ex i I'f-izu 


lOS 


A TCP 08091 
/\ 1 WW TT yoyZt 1 


^iannlTaa 191 \AA 
Olgllailag IZl-l^'f 


106 


ATCC ^ 08090 

1 WW tT ^0"^U 


*sian?ilTcio 67-00 


107 

IV// 


ATCC 08090 
r\ 1 WW TT yoyz,\} 


^ion5*lTcia /^7_00 
olgllal 1 ag O / -yu 


108 


A TCP ^ 08099 


^iCT«5»lTcirr 01 OA OA 07 QQ 107 1 f*Q 1 1 9 1 1 /I 10A 

oignaiiag y i-y^, yo, y /, yy-iu/, luy-iiz ei i i4-izu 


100 


ATPP a QS09'? 

/\ 1 ^^i^ it yoyzj 


oignai 1 ag 44-00 


110 


Al it VoyZZ 


bignallag yi-94, 95, 97, 99-107, 109-1 12 et 1 14-120 


111 
ill 


A 1 ff Voyzz 


bignallag 91-94, 96, 97, 99-107, 109-112 et 114-120 


110 

1 1/ 


Ai it yoyzu 


oignal 1 ag 67-90 


111 


A 1 WW it y oVZU 


Cirt-^^ITort /C7 OA 

oignal 1 ag o7-y U 




A 1 wv^ ff yoyz J 


oignai 1 ag 44-00 


1 1 ^ 


A 1 if yoyzz 


C 1 rr»-i r» TT'n r« At C\A A/C AT AA 1 AT 1 AA 1 T A 

bignauag yi-y4, yo, y /, yy-iu/, iiiy-i iz et i i4-izo 


1 1 A 


AlV^U it yoyL\} 


CirtMnlTrtrt /CT AA 

bignaiiag o/-yu 


117 

11/ 


A1L.U it yoyzz 


C I ^T-M n 1 T^rt A 1 A /I AiC AT Ort 1 AT 1 A A 1 1 O 1 1 >1 1 A 

2»ignai lag 91-94, 96, 97, 99-107, 109-112 et 114-120 


115 


A 1 yoyzz 


Oirt-rtrtlT^rtrt- A1 ClA A/C AT AA 1 AT 1 AA 1 10 ^4- 1 1 >1 ITA 

bignallag 91-94, 96, 97, 99-107, 109- 11 2 et 1 14-120 


1 ly 


A i ft yoyzz 


C i i-r-M 1 A 1 A /I A/C AT A A 1 AT 1 A A 110^* 11/1 1'^A 

Mgnailag yi-y4, yo, y/, yy-iu/, loy-i iz et 1 14-lzO 


190 


A 1 \^\^ ft yoyzz 


QtrrnolT'ar*- 01 Q/l Q/^ 07 OQ 1 AT 1 AA 1 11 a-f 1 1 yl HA 

Mgnauag yi-y4, yo, y /, yy-iu/, luy-i iz et i i4-lz0 


191 


ATPP ii 0809^^ 


oignai 1 ag 44-00 


199 


ATPP 0SQ90 

Alww fr yoyzu 


QirrnolTorr /^7 OA 

oignai 1 ag 0 /-yu 


1 9^? 
IZj 


ATCC U 0SQ9n 

AicL^ ft yoyzu 


Cirti-iolT'rt/^- /CT AA 

Mgnai 1 ag 0 / -yu 


194 


ATPP 08099 
AH^L/ ft yoyzz 


QirrrtolTorr Q1 Q/l Q7 QQ 1 AT 1 AO in ^-i- 11/1 1 TA 

jsignaiiag yi-y4, yo, y /, yy-iu/, luy-i iz et i i4-lzo 


19S 
IZ^ 


PP A PP OS 191 ^HA 
liwA^w^^^ ?? yolZlDUO 


QirrrtolTo/* 11171 QQQ 

oignaiiag iiiziyyo 


126 


ECACC # 98121506 


SisnalTag 11121998 


127 


ECACC# 98121506 


SignalTag 11121998 


128 


ECACC #98121506 


SignalTag 11121998 


129 


ECACC #98121506 


SignalTag 11121998 


130 


ECACC #98121506 


SignalTag 11121998 
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131 


ECACC# 98121506 


SienalTag 11121998 


132 


ECACC# 98121506 


SienalTae 11121998 


133 


ECACC# 98121506 


SienalTag 11121998 


134 


ECACC# 98121506 


SisnalTas 11121998 


135 


ECACC# 98121506 


SienalTas 11121998 


136 


ECACC# 98121506 


SignalTag 11121998 


137 


ECACC# 98121506 


SignalTag 11121998 


138 


ECACC# 98121506 


SignalTag 11121998 


139 


ECACC# 98121506 


SignalTag 11121998 


140 


ECACC# 98121506 


SignalTag 11121998 
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INTERNAL DESIGNATION 
NUMBER 


SEQ ID NO. 


TYPE OF 
SEQUENCE 


20-5-2-C3-CL0 4 


40 


DNA 


20-8-4-A11-CL2 6 


41 


DNA 


21-1-4-F2-CL11 1 


42 


DNA 


22-11-2-H9-CL1 1 


43 


DNA 


25-7-3-D4-CL0 2 


44 


DNA 


26-27-3-D7-CL0 1 


45 


DNA 


26-35-4-H9-CL1 1 


46 


DNA 


26-45-2-C4-CL2 6 


47 


DNA 


27-1-2-B3-CL0 1 


48 


DNA 


27-1-2-B3-CL0 2 


49 


DNA 


27-19-3-G7-CL11 2 


50 


DNA 


33-10-4-E2-CL13 4 


51 


DNA 


33-10-4-H2-CL2 2 


52 


DNA 


33-110-4-A5-CL1J 


53 


DNA 


33-13-1-Cl-CLl 1 


54 


DNA 


33-30-2-A6-CL0 1 


55 


DNA 


33-35-4-F4-CL1 2 


56 


DNA 


33-35-4-Gl-CLl 2 


57 


DNA 


33-36-3-E2-CL1 1 


58 


DNA 


33-36-3-E2-CL1 2 


59 


DNA 


33-36-3-F2-CL2 2 


60 


DNA 


33-4-2-G5-CL2 1 


61 


DNA 


33-49-1-H4-CL1 1 


62 


DNA 


33-66-2-B10-CL4 1 


63 


DNA 


33-97-4-G8-CL2 2 


64 


DNA 


33-98-4-Cl-CLl 3 


65 


DNA 


47-14-1-C3-CL0 5 


66 


DNA 


47-15-1-Ell-CLO 1 


67 


DNA 


47-15-1-H8-CL0 2 


68 


DNA 
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\jy 


r»\rA 


48-1-1-H7-CL0 4 


70 


A-'iNrt. 


48-1-1-H7-CL0 5 


71 

/ 1 




48-3-1-H9-CL0 6 


79 




48-54- 1-G9-CL2 1 


73 


DNA 


48-54- 1-G9-CL3 1 


74 






75 

/ J 


JJinA 


51-1 l-^-nS-PT 1 ^ 


7^ 




51-1 l-^-GO-PT n 1 


77 




51-1 5-4-Al?-rT 1 1 


7» 




51-17-4-A4-rT 3 1 


70 


L/iNA. 


51-9_^-F10-rT 1 5 




JJNA 


^l-'^-d-F^.r'T 11 9 
J l-Z-H-r j-\^Lfl 1 Z 


CI 


UNA 


J 1 -/ /-^-r Z-L/L/U Z 


6Z 


DNA 


0 l-j4-j-r5-UJLU Z 




DNA 


^7 1 A PT 1 7 


QA 


DNA 


57-1 0-9-nS-PT 9 1 

J /" 1 y-Z-VJO-V.^LvZ 1 


Q< 

OJ 


DNA 


57-97_'^_ril O-PT 9 9 


oO 


DNA 


5S-'?'?_^_RJ.-PT 1 7 
J O- J J 0-a5*+-1^1j 1 Z 


5 / 


■PvXT A 

DNA 


5R-'?4-^-PQ-PT 1 7 


OQ 
OO 


T^XT A 

DNA 




QQ 

oy 


"T^XT A 

DNA 


58-48-1 -H'? PT 9 4 


OA 


T\XT A 

DNA 


5R_6-1_R4-PT 1 1 

JO y— i ITA.'^-V^l^ 1 1 




■RXT A 

-UNA 


60-1 2-1 -F1 l-CT 1 9 

UV/ A Aj 1 1 ^1-/ 1 ^ 


09 

!7Z 




65-4-4-H1-rT 1 1 


O'? 


UNA 


74_5-l-F4-rT 1 9 




r\XT A 
UNA 


76-13-3-A9-CLl_2 


95 


DNA 


76-16-1-D6-CL1 1 


96 


DNA 


76-28-3-A12-CL1 5 


97 


DNA 


76-42-2-F3-CL0 1 


98 


DNA 


77-16-4-G3-CL1 3 


99 


DNA 
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100 

1 vV 






101 




7R-97 ni n 1 


1 09 
1 uz 


LJiSr\ 




1 0*^ 


DIM A 


7R_7_1 _nS-PT 9 A 


lOA 


DNA 




10^ 


DMA 


^52 AS A F9 PT n 1 


1 OA 


FiMA 


9';-19»9-n^-PT 1 9 


1 07 




9<; fi 4 m 9 PT n ^ 


1 AS 


L'JNA 


O/; yt/t '5 P^ PT O 1 
ZO-44-j-UD-UJLz 1 


1 no 


jJJNA 


99 1 9 PT n 1 


1 1 n 

1 lU 


r\XT A 


1 9 'J P^ PT ft 1 
jU- 1 Z- J -OD-l^L,U_ 1 


111 
ill 


PiXT A 


11 ^c\^^ o TTi n r^T i i 
DD- 1 Uo-z-r 1 l_j 


1 1 o 

1 iz 


F\XT A 

JJJNA 


i i-z 0-4-JJ 1 -CLU_ 1 


111 


JJNA 


n 111 r^Q r^i o 1 
i j-;5 l-i-Co-CLz__l 


11/1 
1 14 


UJNA 


45-24- 1 -Dz-CL3_z 


lie 

115 


DNA 


/10/l^y| A11 /^T 1 /f 

45-40-4-Al 1-CL1_4 


116 


DNA 


J 1 - 1 -4-d -CLU z 


1 1 7 
11/ 


UNA 


< 1 1 Q 1 UO /^T 1 O 


1 1 O 

1 lo 


r\XT A 

UNA 


< 1 /I O 1 CO PT 1 1 

D 1 -4z- J-r y-L^ij 1 _ 1 


1 1 Q 

1 ly 


T^XT A 

UNA 


^1 ^ P9 PT (\ A 


1 9n 
izu 


r\XT A 
UJNA 


^7184 T4^ PT 9 1 


191 
IZl 




9/^ 91 1 PQ PT 1 1 


1 99 
IZZ 


T^XT A 
UJNA 


7/^ 91 1 PQ PT 1 1 


1 91 
1Z3 


T^XT A 
UiNA 


7R 8 ^ FA PT n 1 


1 9A 
1Z4 


T^XT A 
UJNA 


IQ 1A 1 CO PT 1 '? 


19^ 
IZj 


T^XT A 
UJNA 


33-n-1-Bn-CLl 2 


126 


DNA 


33-113-2-B8-CLl_2 


127 


DNA 


33-19-l-Cll-CLl_l 


128 


DNA 


33-61-2-F6-CL0_2 


129 


DNA 


47_4-4_C6-CL2__2 


130 


DNA 
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T^O «JT^ X VJ ^ WJL/ J. X 


131 


DNA 


51-43-3-G3-CL0 1 


132 


DNA 


55-1-3-Dl l-CT 0 1 

■/.^ 1 J 1-/ 1 1 (V 1 


133 
1 J J 


DNA 




1 34 






1 J J 




/o-io-i-ro-v^i-d 1 


1 jO 




7^-9^_^_nS-rT 9 9 


1 J / 






LJO 




7R_91_3_G7-rT 9 1 


no 




JO H-J-T— J_>1 l-\^J-/ U Z 






90 A 1 ni 1 FT 9 


949 
Z4z 




9n SI A All FT 9 
zU-o-4-A 1 1 -r JLz 


Z4j 


UiNA 


zz-o-z-Ul-rLz 


Z44 




zz- 1 1 -z-riy-r h 1 


Oyl ^ 

z4j 


■r\xT A 
UNA 


Z J -o- J-iD 1 -r L( 1 


04A 
Z40 


T^XT A 
JJJNA 


94 "5 '5 PA FT 1 


0/17 
Z4 / 


JJJNA 


OA A 1 T-T^ FT 1 


0/1 C 
z4o 


T^XT A 
JJJNA 


OA /I ^ O HA FT 0 
zo-4 j-z-v^4-r Lz 


z4y 


r\XT A 


zo-4o- 1 -rl 1 U-rL 1 


ocn 
zjU 


■r\XT A 
UNA 


OA AQ ^ A ^ TTT O 

zO-4y-l-AD-rL.z 


o< 1 

ZJ 1 


■p\XT A 

UNA 


'^n A J. F'? FT 


0^9 
Z jZ 


T^M A 
UiNA 


';^_A-i-ni 1 -FT 1 


Zj J 


JJINA 


^^-R-l -A'^-FT 9 


9SA 

Z J*+ 






9SS 

Z J J 




7^ lA 4 FT FT 1 


9^A 
zjO 


UJNA 


'2'5 01 9 Fi^ FT t 
J J -z 1 -z- JJO -r Lf 1 


9^7 

ZD / 


FiXT A 
UJNA 


33-26-4-ElO-FLl 


958 

^ J o 




33-27-1-Ell-FLl 


259 


DNA 


33-28-4-Dl-FLl 


260 


DNA 


33-28-4-E2-FL2 


261 


DNA 


33-30-4-C4-FL1 


262 


DNA 
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33-35-4-F4-FL1 


263 


DNA 


33-36-3-F2-FL2 


264 


DNA 


33-52-4-F9-FL2 


265 


DNA 


33-52-4-H3-FL1 


266 


DNA 


33-59-1-B7-FL1 


267 


DNA 


33-71-1-A8-FL1 


268 


DNA 


33-72-2-R7-FT 1 








770 






771 


DMA 


33-nO-2-G4-FT 1 


777 


DNA 


47-7-4-n9-FT 2 


773 


DNA 


47-10-2-G12-FT 1 




DNA 




77^ 






77A 
Z /O 


iJ JN A 




777 


"nXT A 


47-1 8 d-F'? FT 9 


77R 


FMsJ A 


4S-';-l-T40-FT ^ 


77Q 


rVMA 
JLyiNA 


4R-4_7-T4^-FT 1 


780 
zou 




4S_A-l-rO-FT 1 


781 

Zo 1 




4S-7-4-T-T9-FT 7 


787 
ZoZ 


FiXT A 




78'i 

ZOJ 


F*>JA 
i-/lNA 




784 




48-1Q-3-A7-FT 1 


78S 




48-lQ-^_G1-FT 1 






48-25-4-D8-FL1 


287 


DNA 


48-21-4-H4-FI 1 


788 


DNA 


48-26-3-B8-FL2 


289 


DNA 


48-29-1-E2-FL1 


290 


DNA 


48-31-3-F7-FL1 


291 


DNA 


48-47-3-A5-FL1 


292 


DNA 


51-1-1-G12-FL1 


293 


DNA 
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294 


DNA 


51-1-4-F9-FT 2 

<J 1. 1 ^ J— A A_/^ 


295 


DNA 


S1-2-1-F10-FT 1 






51-7-^-FlO-FT 1 

J 1 J 1 l.\J r X 


997 




51-9-4-F5-FT 1 


798 
Z"o 




Sl-'^-^-RlO-FT ? 

J 1 J J JJ 1 V J. 


799 




SI-^-'^-Flin-FT 1 






S1_7-^-r;^-FT 1 

J 1" /~J~Vjf j-F JU 1 






Sl-IO-'^-DI 1-FT 1 


juz 




J 1- 1 i-j-uj-r Lt I 


'in'? 






jU4 




^ 1 1 ^ yl TJ 1 n FT 1 


'SAC 


r\XT A 
UJNA 


CI 17/1 A/1 PT1 




FiXT A 


^ 1 - 1 o- 1 -\^J -r JL 1 


JU / 


"nXT A 
UINA 


CI OC '2 XTT 1 




FvXT A 

JDJNA 


CI 97 1 po PI 1 
J 1 -Z / - 1 -Jtio-r L( 1 




rMVT A 

UJNA 


CI 9 PI 7 




T^XT A 

UJNA 


CI '20 '7 UrO PT 1 


'2 11 

ill 


r\XT A 

UNA 


C 1 AO 1 PQ PT 1 


'2 1 O 
J IZ 


"T^XT A 

UJNA 


CI A A A V(A PT 1 


J ID 


T^XT A 
UJNA 




'XAA 


T^XT A 
UJNA 




J IJ 






J 1 D 


UlN A 




^ 1 7 


T^XTA 
UiNA 


S7-97-^-A 1 1 PT 1 
J /-z 1 1-rLri 


'? 1 ft 
i lo 


T^XT A 

UJNA 


S7-97-'^-nin-FT 0 




UiNA 


58-10-3-D12-FL1 


320 


DNA 


58-11-1-GlO-FLl 


321 


DNA 


58-11-2-G8-FL2 


322 


DNA 


58-36-3-A9-FL2 


323 


DNA 


58-38-1-A2-FL2 


324 


DNA 
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58-38-1-E5-FL1 


325 


DNA 


58-44-2-B3-FL3 


326 


DNA 


58-45-3-Hll-FLl 


327 


DNA 


58-53-2-B12-FL2 


328 


DNA 


59-9-4-AlO-FLl 


329 


DNA 


60-1 6-3- A6-FL1 


330 


DNA 


60-17-3-G8-FL2 


331 


DNA 


62-5-4-BlO-FLl 


332 


DNA 


65-4-4-H3-FL1 


333 


DNA 


74-3-1-B9-FL1 


334 


DNA 


76-4-1-G5-FL1 


335 


DNA 


76-7-3-A12-FL1 


336 


DNA 


76-16-4-C9-FL3 






76-^0-^-R7-FT 1 






77-5-1-C2-FL1 






77-5-4-E7-FL1 


340 


DNA 


77-1 1-1 -A3-FL1 


341 


DNA 


77-16-3-D7-FL1 

/ / Xr\J t X X^ X 


342 


DNA 


77-16-4-G3-FL1 


343 
j'-tj 


DNA 


77-25- 1-A6-FL1 


344 


DNA 


77-26-2-F2-FL3 


34S 


DNA 


78-6-2-E3-FL2 


346 


DNA 


78-7-1-G5-FL2 


347 


DNA 


78-16-2-C2-FL1 


348 


DNA 


78-18-3-B4-FL3 


349 


DNA 


78-20-1-Gll-FLl 


350 


DNA 


78-22-3-ElO-FLl 


351 


DNA 


78-24-2-B8-FL1 


352 


DNA 


78-24-3-A8-FL1 


353 


DNA 


78-24-3-H4-FL2 


354 


DNA 


78-25-1-Fll-FLl 


355 


DNA 
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78_26-l-B5-FLI 


356 


DNA 


78-27-3 -Dl-FLl 


357 


DNA 


78-29- 1-B2-FL1 


■J JO 




78-29-4-B6-FL1 


359 


DNA 


14-1-3_B6-FL1 


360 


DNA 


30-9-1-G8-FL2 


361 


DNA 


33-10-4-H2-FL2 


362 


DNA 


33-10-4-H2-FT 1 




DNA 


74_lO-^-rQ-FT 2 




r»JA 


'5'5-Q7-4-n8-FT 


J VJ 


n>JA 


^^-07-4-G8-FT 2 
vjo J. JL/^ 














J Do 




47-^7_4-Gl 1-FT 1 




nXTA 


S7-7S-1-Fin-FT 7 














'579 


JJINA 




J / J 




76-91-1 _r4-FT 1 






78-26-2-H7-FT 1 


^7S 




77-20-2-F n -FT 1 


'^76 




47-l-^-F7-FT 2 


D I 1 


DMA 


90-5-2-r^-rT 0 4 

ifV -J Z> L/ V *T 


Ml 


Jrivl 


90-8-4- A 1 l-CT 2 


M9 




91_1-4-F2-rT 1 1 1 


1 J.^ 


PPT 


22- 1 1 -2-H9-rT 1 1 




PPT 


25-7-3-D4-CL0_2 


145 


PRT 


26-27-3-D7-CL0_i 


146 


PRT 


26-35-4-H9-CLl__l 


147 


PRT 


26-45-2-C4-CL2__6 


148 


PRT 


27-l-2-B3-CL0_l 


149 


PRT 
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27-1-2-B3-CL0 2 


150 


PRT 


27-19-3-G7-CL11 2 


151 


PRT 


33-10-4-E2-CL13 4 


1 ^2 


PRT 

Jrivx 


33-10-4-H2-CL2 2 


1 S3 


PRT 
x^xvx 


33-1 10-4-AS-rT 1 1 


1 S4 


X Ivl 




1 

1 J J 


X ivl 




1 S^ 
1 JO 


PUT 
xlvi 


33-3'5-4-F4-rT 1 2 


1 S7 


PRT 

X xvi 


'^^_';s-4-Gi-rT 1 2 




PRT 




1 


PT?T 
x^lvl 


'SO '5/: '5 pT 1 o 




rKl 




1 /CI 


rKl 




loz 


rKl 


'^'^ 1 V(A 1 1 
jj-Hy-i-rtH~^L> 1_1 




rKl 


1'^ 9 R1 n PT 4 1 

J J -OD-Z- JC5 1 U-^^LjH 1 


104 


JrKl 




loo 


r'Kl 




TOD 


rKl 


Ai ^d ^ P'? pt n 


10 / 


rKl 


1^ 1 T711 Pin 1 


loo 


rKl 


HR PT n 7 
^ Z" IJ" l-rlO-K^L/Xj Z 


1 AO 

loy 


DOT" 

rKl 




1 70 

1 /u 


rKl 


48-1-1-H7-PT 0 d 


171 
1/1 


PPT 


4R_1_1_H7-PT f) S 


1 79 
1 /Z 


PPT 




1 / J 


PRT 




1 74 
1 /H- 


PPT 
rKl 


4R-S4 1 no PT 1 


1 7^ 
1 /D 


rKl 


48-7-4-H2-CL2 2 


176 


PRT 


51-ll-3-D5-CLl_3 


177 


PRT 


51-ll-3-G9-CL0_l 


178 


PRT 


51-15-4-A12-CL11 3 


179 


PRT 


51-17-4-A4-CL3„l 


180 


PRT 



-191- 



CONT. TABLE VII 



51-2-3-FlO-CLl 5 


181 




51-2-4-F5-CL11 2 


182 


PRT 


51-27-4-F2-CL0 2 


183 


PRT 


51-34-3_F8-CL0 2 


184 


PRT 


57-1-4-E2-CL1 2 


185 


PRT 


57-19-2-G8-CL2 1 


186 


PRT 


57-27-3-G10-CL2 2 


187 


PRT 


58-33-3-B4-CL1 2 


188 
1 oo 




58-34-3-C9-CL1 2 


189 


PRT 

IT IV 1 


58-4-4-G2-CL2 1 


190 


PRT 


58-48- 1-G3-CL2 4 


101 


PRT 


58-6-1 -H4-rT 1 1 




PRT 


60-1 7-1 -F1 l-PT 1 9 




PPT 


6 5-4-4-14^ -PT 1 1 


1 OA 






1 QS 

1 "J 


PTJT 


76-1^-VAO-rT 1 9 


1 7D 


PPT 




1 Q7 
1 " / 


P1?T 


76-28-3-A12-CL1 5 


198 


PRT 


76-42-2-F3-CT 0 1 


1QQ 


PPT 


77-16-4-G3-rT 1 3 


700 


PPT 


77-39-4-H4-CL11 4 


701 


PPT 


78-24-3 -H4-rT 2 1 


707 


PPT 


78-27-3 -Dl-CT 1 6 


70^ 


PPT 


78-28-3-D2-CL0 2 


704 


PRT 


78-7-1-G5-CL2 6 


70^ 


PPT 


84-3-1-GlO-CLll 6 


206 


PRT 


58-48-4-E2-CL0_l 


207 


PRT 


23-12-2-G6-CLl_2 


208 


PRT 


25-8-4-B12-CL0 5 


209 


PRT 


26-44-3-C5-CL2_l 


210 


PRT 


27-1-2-B3-CL0 3 


211 


PRT 
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30-12-3-G5-CL0 1 


212 


PRT 


33-106-2-FlO-CLl 3 


213 


JTXVl 


33-28-4-Dl-CLO 1 


214 


IT rv 1 


33-31-3-C8-CL2 1 


215 


PRT 


48-24- 1-D2-CL3 2 


916 


PRT 


48-46-4-All-CLl 4 


217 


PRT 


51-1-4-Cl-CLO 2 


218 


PRT 




91Q 




51-42-l-F9-rT 1 1 


990 


PRT 




991 


PPT 






DDT 




99'5 


DDT 




99A 


DDT 




99^ 
ZZ J 


DDT 


iQ_in-i-r9-rT i % 


99/^ 
zzo 


DPT 


'^^-l 1-1 -Rl l-PT 1 9 


997 
zz / 


PPT 




99ft 
ZZo 


DDT 




990 

zzy 


DDT 
rKl 


^^-^1-9-P6-PT 0 9 


zju 


DDT 
rKl 


47-4-4-pA_PT 9 9 


9^? 1 
Z J 1 


DPT 


48-54- 1-G9-CT 1 1 


9^9 


PPT 


51-43-3-G3-CL0 1 


233 


PRT 
r ivx 


55-1 -3-D 1 1 -CI 0 1 


9^4 


PPT 
r JV 1 


5R_14-9-m-PT 1 9 


9^S 

Zj J 


PPT 
rKl 


58-^5-9-R6-PT 9 ^ 


Z_>D 


PPT 
rKl 


76-lS-1-Ffi-PT 1 1 


9'^7 
Z J / 


DPT 
rKl 


76-23-3-G8-CL2__2 


238 


PRT 


76-30-3-B7-CLl_l 


239 


PRT 


78-21-3-G7-CL2J 


240 


PRT 


58-45-4-Bll-CL13_2 


241 


PRT 


20-6-1-D11-FL2 


378 


PRT 



-193- 



CONT. TABLE VII 



1f\ Q A All TTT 0 


51y 


PKT 


ZZ-O-Z-C 1 -r L,Z 


3oU 


FRT 


zz- 1 1 -z-riy-r JL 1 


'2 01 

3ol 




Z J -o- J -L> 1 -r Lf 1 


'2 09 
JOZ 


DOT 


Z4- J- J -OO-r L, 1 


'20'2 


rKF 


0/1 /I 1 TJ^ T?T 1 


384 


FRT 


Zo-4 j-z-C4-r 


T OC 

385 


FRT 


m AO 1 T T 1 A T?T 1 


386 


FRT 


zo-4y- 1 -AD-r Lz 


387 


FRT 


JU-O-4-bi-rLi 


"2 00 

388 


FRT 


'2 1 ^ 1 P 1 1 T7T 1 


389 


FRT 


33-6-1-A3-FL2 


390 


FRT 


33-11-3-C6-FL1 


391 


FRT 


33-14-4-El-FLl 


392 


FRT 


33-21-2-D5-FL1 


393 


FRT 


33-Z6-4-E10-FL1 


394 


FRT 


33-27-1 -hi l-FLl 


395 


FRT 


'5 '5 T\1 T7T 1 

33-26-4-Dl-FLl 


396 


FRT 


33-28-4-E2-FL2 


397 


FRT 


33-30-4-C4-FL1 


398 


FRT 


33-35-4-r4-FLl 


399 


FRT 


33-3o-3-rz-rLz 


400 


FRT 


'2 '2 <0 yt "CO "CT 1 

j3->z-4-ry-rLz 


401 


FRT 


CO /I XT'? T?T 1 

J3-0Z-4-rli-rL 1 


402 


FRT 




403 


FRT 


j3-/l-l-Ao-rLl 


404 


FRT 






DOT 


33-105-2-C3-FL1 


406 


PRT 


33-107-4-C3-FL1 


407 


PRT 


33-110-2-G4-FL1 


408 


PRT 


47-7-4-D2-FL2 


409 


PRT 
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47-lO-7-G1'2-FT 1 


410 


PUT 




41 1 


PT?T 

Jrlvl 




417 


X Jvl 




413 


PT?T 


47-18-4-E3-FL2 


414 


PT?T 


48-3-1 -H9-FI 3 


41 ^ 


PT^T 

± Iv 1 


4R_4_2-H3-FT 1 


H ID 


PPT 




417 


PPT 


4R-7-4-149-FT 7 


41 8 


PPT 




410 


rKl 


4R-1 '^-'^-RS-FT 1 


d7n 


rKl 




4Z1 


rKl 


48-1 0-'l m FT 1 


4ZZ 


rKl 




4z3 


rKl 


48 714 T44 FT 1 


4z4 


rKl 


48 7^ 'X 1^8 FT 7 


4z!) 


rKl 


48 70 1 F7 FT 1 
*f o-Z V- 1 -HZ-r i-( 1 


4z0 


rKl 


48 '^1 '? F7 FT 1 


yl07 
4z / 


rKl 


48-47-'^ - A ^-FT 1 


A78 
4Zo 


rKl 


^1_l_l_ni7-FT 1 
Jill vj 1 


47Q 

4zy 


rKl 


51-1-4-FQ_FT 3 




PPT 
rKl 


^1-1 _4-FQ-FT 7 


4^1 
4 J 1 


PPT 
rKl 


^1-7-1 -Fin FT 1 
J 1 -z- 1 -li 1 u-r ly 1 


4jZ 


rKl 


^1 -7-'l-Fl O-FT 1 


4jj 


rKl 


Sl-7-4_FS-FT 1 


4j4 


rKl 


S1-3-3-R10-FT 9 


4^^ 


PPT 
rKl 


51-3-3-B10-FL3 


436 


PRT 


51-7-3-G3-FL1 


437 


PRT 


51-10-3-Dll-FLl 


438 


PRT 


51-11-3-D5-FL1 


439 


PRT 


51-13-1-F7-FL3 


440 


PRT 
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441 


PRT 
r ivl 


51_17-4-A4-FT 1 


449 


FlN.l 






PPT 
Flvl 






PPT 
fKI 




443 


DDT 

rKl 


^1 ?R 9 m FT 9 
J 1 -Zo-z-vj 1 -r JLZ 


440 


DDT 

rKi 




/1/17 
44 / 


DDX 

rKl 


S 1 -49-';-FQ-FT 1 


446 


DPT 
rKl 




44y 


DDT 
rKl 




43U 


DDT* 

rKl 




/I C 1 

431 


DDT 

rKi 


!) o-zo-o -U 1 -r L 1 


yi C^ 

452 


PRT 


J /-lo-l-JL)j-rLl 


453 


PRT 


;> /-z /-J-Al l-rL,l 


/I C yi 

454 


PRT 


^7 97 r;i A FT 9 
3 /-Z /-j-LrlU-rLrZ 


/ICC 

433 


PRT 


^0 in '5 ni9 FT 1 

J o- 1 -iJ 1 Z-r Lr 1 


/I c/; 
430 


PKl 


coil 1 /^l n FT 1 
36-1 l-i-01U-rI>l 


43/ 


PRT 


CO 11 ^ r^Q ITT O 


/ICO 

458 


PRT 


'2 A '2 A Q FT O 


A cn 
459 


PRT 


CO ^o 1 AO FT 9 
3 O O - 1 - AZ-r L/Z 


4oU 


PRT 




401 


DDT 

rKl 


SS-44-9-T^^-FT ^ 


4A9 
40Z 


DDT 
rKl 




403 


DDT 

rKX 


SR-S'^-9-T^ 1 9 FT 9 
J o~j J ~z~ij 1 z-r JL/Z 


404 


DDT 

rKi 


CO 0 yt A 1 A PT 1 


403 


T»"r> T 

PRT 


/^n 1A '5 FT 1 
OU- 1 0-3-AO-r L> 1 


400 


TIT* T 

PRT 


60-17-3-G8-FL2 


467 


PPT 


62-5-4-BlO-FLl 


468 


PRT 


65-4-4-H3-FL1 


469 


PRT 


74-3-1-B9-FL1 


470 


PRT 


76-4-1-G5-FL1 


471 


PRT 
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76-7-3-A12-FT 1 


477 

H- / Z 


PT?T 
rJvi 


76-16-4-C9-FT ^ 


47'^ 


PT?T 


76-30-3-B7-FL1 


474 


PRT 
± Xvl 


77-5-1-C2-FL1 


475 




77-5_4_E7-FLl 


47fi 


PPT 


77-11-1-A3-FL1 


477 


PPT 


77_16-^-D7-FT 1 


47 R 


PPT 


77-1 ^-4.-^'? -FT 1 




rKl 


77-9S-l-Afi-FT 1 




DPT 


77-76-9-F7-FT ^ 


H-O 1 


PPT 




4R7 


DDT 


751-7-1 -H^ FT 7 




DDT 


7R-1/1-7-P7 FT 1 




rKl 


/ 0- 1 0- J -r>H— r Lf J 


45J 


PRT 


/o-ZU-l-Lrl l-rL,l 


486 


PRT 


7Q 77 Q 1 A TTT 1 
/ o-zZ- j-Jd 1 U-r Lf 1 


45/ 


PRT 


75? O/i o TDQ pT 1 
/ o-ZH-Z-r3o-rLf 1 


4oo 


Tin 

PRT 


70 9/1 '2 AS FT 1 


4dy 


PRl 


7S-74-^-14J.-FT 7 




DDT 
PKl 


78-7^-1 -Fl 1-FT 1 


401 
4!? 1 


DDT 

rKl 


78-26-1 -R5-FT 1 


d07 
4yz 


DDT 
rKl 


7S-77-'?-ri1 FT 1 
/ o-z / o~u 1 -r 1 


4yj 


rKl 


78-70-1 -T^7 -FT 1 


4^4 


DDT 

rKl 


78-79-4-R6-FT 1 


40^ 

4yj 


DDT 

rKl 


14-1-3-Ffi-FT 1 


40^ 


DDT 

rKl 


'50-Q-1-G8-FT 7 


407 
4? / 


DDT 

rKl 


33-10-4-H2-FL2 


498 


PRT 


33-10-4-H2-FL1 


499 


PRT 


74-10-3-C9-FL2 


500 


PRT 


33-97-4-G8-FL3 


501 


PRT 


33-97-4-G8-FL2 


502 


PRT 
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J J- 1 UT'~H-jrL'T"r Lt 1 


DKfj 


DDT 


**T / ^ ^ J_> J JT JL/ 1 




DDT 


47-'?7-A-ni 1 FT 1 




DDT 


^7-9^-1 Fin FT 9 


DUO 


DDT 


10 Til FT 1 




DDT" 




jUo 


DDT 


58-48-4-E2-FL2 


509 


J: IV 1 


76-21-1-C4-FL1 


510 


PRT 


78-26-2-H7-FL1 


511 


PRT 


77-20-2-Ell-FLl 


512 


PRT 


47-1-3.F7-FL2 


513 


PRT 
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TABLE VIII 



ID 


Locations 


PROSITE Signature Name 


195 


110-121 


Aldehyde dehydrogenases csyteine active site 


221 


28-37 


ATP synthase alpha and beta subunits signature 


223 


171-181 


Regulator of chromosome condensation (RCCl) signature 2 


225 


90-112 


Phosphatidylethanolamine-binding protein family signature 


226 


10-34 


Protein kinases ATP-binding region signature 



o 

iij 

m 
=1- 

in 

u 
ill 
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WHAT IS CLAIMED IS: 

1. A purified or isolated nucleic acid comprising the sequence of one of 
SEQ ID NOs: 40-140 and 242-377 or a sequence complementary thereto. 

2. A purified or isolated nucleic acid comprising at least 10 consecutive 
5 bases of the sequence of one of SEQ ID NOs: 40-140 and 242-377 or one of the 

sequences complementary thereto. 

3. A purified or isolated nucleic acid comprising the full coding sequences 
of one of SEQ ID NOs: 40, 42-44, 46, 48, 49, 51, 53, 60, 62-72, 76-78, 80-83, 85-88, 
90, 93, 94, 97, 99-102, 104, 107-125, 127, 132, 135-138, 140 and 242-377 wherein the 

10 full coding sequence comprises the sequence encoding signal peptide and the sequence 
encoding mature protein. 

4. A purified or isolated nucleic acid comprising the nucleotides of one of 
SEQ ID NOs: 40-44, 46, 48, 49, 51-53, 55, 56, 58-72, 75-78, 80-88, 90, 93, 94, 97, 99- 
125, 127, 132, 133, 135-138, 140, and 242-377 which encode a mature protein. 

15 5. A purified or isolated nucleic acid comprising the nucleotides of one of 

SEQ ID NOs: 40, 42-46, 48, 49, 51, 53, 57, 60, 62-73, 76-78, 80-83, 85-88, 90, 93-95, 
97, 99-102, 104, 107-125, 127, 128, 130, 132, 134-140 and 242-377 which encode the 
signal peptide. 

6. A purified or isolated nucleic acid encoding a polypeptide having the 
20 sequence of one of the sequences of SEQ ID NOs: 141-241 and 378-513. 

7. A purified or isolated nucleic acid encoding a polypeptide having the 
sequence of a mature protein included in one of the sequences of SEQ ID NOs: 141- 
145, 147, 149, 150, 152-154, 156, 157, 159-172, 176-179, 181-189, 191, 194, 195, 198, 
200-226, 228, 233, 234, 236-239, 241 and 378-513. 

25 8. A purified or isolated nucleic acid encoding a polypeptide having the 

sequence of a signal peptide included in one of the sequences of SEQ ID NOs: 141, 
143-147, 149, 150, 152, 154, 158, 161, 163-174, 177-179, 181-184, 186-189, 191, 194- 
196, 198, 200-203, 205, 208-226, 228, 229, 231, 233, 235-241, and 378-513. 
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9. A purified or isolated protein comprising the sequence of one of SEQ ID 
NOs: 141-241 and 378-513. 

10. A purified or isolated polypeptide comprising at least 10 consecutive 
amino acids of one of the sequences of SEQ ID NOs: 141-241 and 378-513. 

5 1 1. An isolated or purified polypeptide comprising a signal peptide of one of 

the polypeptides of SEQ ID NOs: 141, 143-147, 149, 150, 152, 154, 158, 161, 163-174, 
177-179, 181-184, 186-189, 191, 194-196, 198, 200-203, 205, 208-226, 228, 229, 231, 
233, 235-241, and 378-513. 

12. An isolated or purified polypeptide comprising a mature protein of one 
10 of the polypeptides of SEQ ID NOs: 141-145, 147, 149, 150, 152-154, 156, 157, 159- 

172, 176-179, 181-189, 191, 194, 195, 198, 200-226, 228, 233, 234, 236-239, 241 and 
378-513. 

13. A method of making a protein comprising one of the sequences of SEQ 
ID NO: 141-241 and 378-513, comprising the steps of 

15 obtaining a cDNA comprising one of the sequences of sequence of SEQ 

ID NO: 40-140 and 242-377; 

inserting said cDNA in an expression vector such that said cDNA is 
operably linked to a promoter; and 

introducing said expression vector into a host cell v^hereby said host cell 
20 produces the protein encoded by said cDNA. 

14. The method of Claim 13, further comprising the step of isolating said 

protein. 

15. A protein obtainable by the method of Claim 1 4. 

1 6. A host cell containing a recombinant nucleic acid of Claim 1 . 

25 17. A purified or isolated antibody capable of specifically binding to a 

protein having the sequence of one of SEQ ID NOs: 141-241 and 378-513. 

18. In an array of polynucleotides of at least 15 nucleotides in length, the 
improvement comprising inclusion in said array of at least one of the sequences of SEQ 
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ID NOs: 40-140 and 242-377, or one of the sequences complementary to the sequences 
of SEQ ID NOs: 40-140 and 242-377, or a fragment thereof of at least 15 consecutive 
nucleotides. 

19. A purified or isolated nucleic acid of at least 15 bases capable of 
5 hybridizing under stringent conditions to the sequence of one of SEQ ID NOs: 40- 1 40 and 

242-377 or a sequence complementary to one of the sequences of SEQ ID NOs: 40-140 
and 242-377. 

20. A purified or isolated antibody capable of binding to a polypeptide 
comprising at least 10 consecutive amino acids of the sequence of one of SEQ ID NOs: 

10 141-241 and 378-513. 
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SEQUENCE LISTING 



<110> Edwards, Jean-Baptiste Dumas Milne 
Duclert, Aymeric 
Bougueleret, Lydie 

<120> Extended cDNAS 

<130> GENSET.019A 

<160> 519 

<170> Patent, pm 



<210> 1 
<211> 47 
<212> RNA 

<213> Artificial Sequence 
<220> 

<221> In vitro transcription product 
<221> modif ied__base 
<222> (1) . . . (1) 
<223> m7g 

<400> 1 

ngcauccuac ucccauccaa uuccacccua acuccuccca ucuccac 47 



<210> 2 
<211> 46 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> In vitro transcription product 
<400> 2 

gcauccuacu cccauccaau uccacccuaa cuccucccau cuccac 4 6 



<210> 3 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> In vitro transcription product 



<400> 3 

atcaagaatt cgcacgagac catta 



25 



<210> 4 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 
<400> 4 

taatggtctc gtgcgaattc ttgat 



<210> 5 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

,:-,^:223> Oligonucleotide 

;;;[4:4 00> 5 

^ ^'Ccgacaagac caacgtcaag gccgc 

L,K210> 6 
|f|c211> 25 
^|"<212> DNA 

:i^:i<213> Artificial Sequence 

J^;^220> 

■^''5223> Oligonucleotide 

■ij!k400> 6 

i^Jtcaccagcag gcagtggctt aggag 



<210> 7 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 
<400> 7 

agtgattcct gctactttgg atggc 



<210> 8 
<211> 25 
<212> DNA 



<213> Artificial Sequence 



<220> 

<223> Oligonucleotide 
<400> 8 

gcttggtctt gttctggagt ttaga 



<210> 9 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 9 

tccagaatgg gagacaagcc aattt 



t^<210> 10 
ll=211> 25 
-k212> DNA 

j'|:213> Artificial Sequence 
;|:220> 

'i%223> Oligonucleotide 

<400> 10 
r^9"99^999-g9" aaacagcgtg agtcc 



j;^210> 11 
V^211> 25 
'<212> DNA 
<213> Artificial Sequence 

<220> 

<223> Oligonucleotide 
<400> 11 

atgggaaagg aaaagactca tatca 



<210> 12 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 
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<400> 12 

agcagcaaca atcaggacag cacag 25 



<210> 13 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 
<400> 13 

atcaagaatt cgcacgagac catta 25 



<210> 14 

<211> 67 

<212> DNA 

;.,^213> Artificial Sequence 

ff 220> 

■'^=22 3 > Oligonucleotide 
Jl4 00> 14 

liatcgttgaga ctcgtaccag cagagtcacg agagagacta cacggtactg gttttttttt 60 
jjbttttvn 67 



, <:210> 


15 


'f<211> 


29 


^^^212> 


DNA 




Artificial Sequence 


:);k22o> 




<223> 


Oligonucleotide 


<400> 


15 



ccagcagagt cacgagagag actacacgg 2 9 



<210> 16 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 



<400> 16 

cacgagagag actacacggt actgg 



25 
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<210> 17 

<211> 526 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> complement (261 376) 

<223> blastn 

<221> misc__feature 

<222> complement (380. .486) 

<223> blastn 

<221> misc_feature 

<222> complement (110. .145) 

<223> blastn 

<221> misc__feature 

<222> complement (196 . .229) 

<223> blastn 

=^;k221> sig__peptide 

f'|:222> 90. .140 

^*^223> Von Heijne matrix 

irk400> 17 

iSaatatrarac agctacaata ttccagggcc artcacttgc catttctcat aacagcgtca 60 
,j,^agagaaaga actgactgar acgtttgag atg aag aaa gtt etc etc ctg ate 113 
Jl Met Lys Lys Val Leu Leu Leu lie 

-15 -10 
^^aca gee ate ttg gca gtg get gtw ggt ttc cca gtc tct caa gac cag 161 
, :;rhr Ala He Leu Ala Val Ala Val Gly Phe Pro Val Ser Gin Asp Gin 

-5 1 5 

"'""gaa cga gaa aaa aga agt ate agt gac age gat gaa tta get tea ggr 209 
';*blu Arg Glu Lys Arg Ser lie Ser Asp Ser Asp Glu Leu Ala Ser Gly 
JH 10 15 20 

3'|tftt ttt gtg ttc cet tac cca tat cca ttt cgc cca ett cca cea att 257 
Xaa Phe Val Phe Pro Tyr Pro Tyr Pro Phe Arg Pro Leu Pro Pro He 

25 30 35 

cea ttt cca aga ttt cea tgg ttt aga cgt aan ttt cet att cea ata 305 
Pro Phe Pro Arg Phe Pro Trp Phe Arg Arg Xaa Phe Pro He Pro He 
40 45 50 55 

cet gaa tct gcc cet aca act cec ett cet age gaa aag taaacaaraa 354 
Pro Glu Ser Ala Pro Thr Thr Pro Leu Pro Ser Glu Lys 

60 65 
ggaaaagtca crataaacet ggtcaectga aattgaaatt gagccactte cttgaaraat 414 
caaaattcet gttaataaaa raaaaacaaa tgtaattgaa atagcacaca gcattcteta 4 74 
gteaatatet ttagtgatct tetttaataa acatgaaagc aaaaaaaaaa aa 526 



<210> 18 

<211> 17 

<212> PRT 

<213> Homo 



sapiens 
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<220> 

<221> SIGNAL 
<222> 1. .17 

<223> Von Heijne matrix 
score 8.2 

seq LLLITAILAVAVG/FP 
<400> 18 

Met Lys Lys Val Leu Leu Leu lie Thr Ala lie Leu Ala Val Ala Val 
15 10 15 

Gly 



<210> 19 

<211> 822 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
,,,,<222> 260 . ,464 
";;f:223> blastn 

^^Jc221> misG_feature 
H^222> 118 . .184 
y'l:223> blastn 



Vk222> 454. .485 
1=^^223 > blastn 

i;JJk221> misc_feature 
fy^222> 118 . .545 
''<223> blastn 

<221> misc_feature 
<222> 65. .369 
<223> blastn 

<221> misc_feature 
<222> 61. .399 
<223> blastn 

<221> misc_feature 
<222> 408 . .458 
<223> blastn 

<221> misc_feature 
<222> 60. .399 
<223> blastn 

<221> misc feature 



lj ,p221> 
IY<222> 
<223> 



misc_feature 
56. .113 
blastn 



<221> 



misc feature 
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<222> 393 . .432 
<223> blastn 



<221> sigjpeptide 

<222> 346 . .408 

<223> Von Heijne matrix 



<400> 19 

actcctttta gcataggggc ttcggcgcca gcggccagcg ctagtcggtc tggtaagtgc 60 
ctgatgccga gttccgtctc tcgcgtcttt tcctggtccc aggcaaagcg gasgnagatc 120 
ctcaaacggc ctagtgcttc gcgcttccgg agaaaatcag cggtctaatt aattcctctg 180 
gtttgttgaa gcagttacca agaatcttca accctttccc acaaaagcta attgagtaca 240 
cgttcctgtt gagtacacgt tcctgttgat ttacaaaagg tgcaggtatg agcaggtctg 300 
aagactaaca ttttgtgaag ttgtaaaaca gaaaacctgt tagaa atg tgg tgg ttt 357 

Met Trp Trp Phe 
-20 

cag caa ggc etc agt ttc ctt cct tea gcc ctt gta att tgg aca tct 405 
Gin Gin Gly Leu Ser Phe Leu Pro Ser Ala Leu Val lie Trp Thr Ser 

-15 -10 -5 

get get ttc ata ttt tea tae att act gea gta aca etc eae eat ata 453 
Ala Ala Phe lie Phe Ser Tyr lie Thr Ala Val Thr Leu His His lie 
15 10 15 

^;^ac eeg get tta cct tat ate agt gac act ggt aca gta get eca raa 501 
■^rksp Pro Ala Leu Pro Tyr lie Ser Asp Thr Gly Thr Val Ala Pro Xaa 
y 20 25 30 

F%aa tge tta ttt ggg gea atg eta aat att geg gca gtt tta tgt caa 549 
Ijliys Cys Leu Phe Gly Ala Met Leu Asn lie Ala Ala Val Leu Cys Gin 
IS 35 40 45 

jEiaa tagaaateag gaarataatt caacttaaag aaktteattt catgaeeaaa 602 

'ctettcaraa acatgtcttt acaagcatat ctcttgtatt gctttctaca ctgttgaatt 662 

^ „gtctggeaat atttctgeag tggaaaattt gatttarmta gttcttgact gataaatatg 722 

;^'^^taaggtggg cttttccccc tgtgtaattg gctactatgt cttactgagc caagttgtaw 782 

^tttgaaataa aatgatatga gagtgaeaca aaaaaaaaaa 822 



;*|C210> 20 

'<211> 21 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> 1. .21 

<223> Von Heijne matrix 
score 5 . 5 

seq SFLPSALVIWTSA/AF 



<400> 20 

Met Trp Trp Phe Gin Gin Gly Leu Ser Phe Leu Pro Ser Ala Leu Val 
15 10 15 

lie Trp Thr Ser Ala 
20 
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<210> 21 

<211> 405 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<222> complement (103 .. 398) 

<223> blastn 



<221> sigjpeptide 

<222> 185. .295 

<223> Von Heijne matrix 



<400> 21 

atcaccttct tctccatcct tstctgggcc agtccccarc ccagtccctc tcctgacctg 60 
cccagcccaa gtcagccttc agcacgcgct tttctgcaca cagatattcc aggcctacct 120 
ggcattccag gacctccgma atgatgctcc agtcccttac aagcgcttcc tggatgaggg 180 
tggc atg gtg ctg acc acc etc ccc ttg ccc tct gcc aac age cct gtg 229 
Met Val Leu Thr Thr Leu Pro Leu Pro Ser Ala Asn Ser Pro Val 
-35 -30 -25 

^;,,fac atg ccc acc act ggc ccc aac age ctg agt tat get age tct gcc 277 
;=^|^sn Met Pro Thr Thr Gly Pro Asn Ser Leu Ser Tyr Ala Ser Ser Ala 
^ -20 -15 -10 

J=^tg tec ccc tgt ctg ace get cea aak tec ccc egg ett get atg atg 325 

P'kjBU Ser Pro Cys Leu Thr Ala Pro Xaa Ser Pro Arg Leu Ala Met Met 

Ul -5 1 5 10 

;;|£:ct gac aac taaatateet tatccaaate aataaarwra raatcctccc 374 

iljpro Asp Asn 

jij^tccaraaggg tttetaaaaa caaaaaaaaa a 405 



f4:210> 22 

^%;211> 37 

'^1:2 12 > PRT 

y|:213> Homo sapiens 

^<220> 

<221> SIGNAL 
<222> 1. .37 

<223> Von Heijne matrix 
score 5 . 9 

seq LSYASSALSPCLT/AP 



<400> 22 

Met Val Leu Thr Thr Leu Pro Leu 
1 5 
Met Pro Thr Thr Gly Pro Asn Ser 
20 

Ser Pro Cys Leu Thr 
35 



Pro Ser Ala Asn Ser Pro Val Asn 

10 15 
Leu Ser Tyr Ala Ser Ser Ala Leu 
25 30 



<210> 23 
<211> 496 
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<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> 149. .331 
<223> blastn 

<221> misc_feature 
<222> 328. .485 
<223> blastn 

<221> mi sc_f eature 

<222> complement (182 . .496) 

<223> blastn 

<221> sig__peptide 

<222> 196. .240 

<223> Von Heijne matrix 

<400> 23 

.«,,^aaaaattgg tcccagtttt caccctgccg cagggctggc tggggagggc agcggtttag 60 

'^;^ttagccgtg gcctaggccg tttaacgggg tgacacgagc ntgcagggcc gagtccaagg 12 0 

J|::ccggagata ggaccaaccg tcaggaatgc gaggaatgtt tttcttcgga ctctatcgag 180 

|;^i;^cacacagac agacc atg ggg att ctg tct aca gtg aca gcc tta aca ttt 231 
ii'i: Met Gly lie Leu Ser Thr Val Thr Ala Leu Thr Phe 











-15 








-10, 








-5 






ilgcc 


ara 


gcc 


ctg 


gac ggc 


tgc 


aga 


aat 


ggc 


att 


gcc 


cac 


cct 


gca 


agt 


279 


il^la 


Xaa 


Ala 


Leu 


Asp Gly Cys Arg Asn Gly 


He 


Ala 


His 


Pro 


Ala 


Ser 










1 






5 










10 










gag 


aag 


cac 


aga 


etc gag 


aaa 


tgt 


agg 


gaa 


etc 


gag 


asc 


asc 


cac 


teg 


327 


jplu 


Lys 


His 


Arg 


Leu Glu 


Lys 


Cys 


Arg 


Glu 


Leu 


Glu 


Xaa 


Xaa 


His 


Ser 






15 








20 










25 












;!^gcc 


cca 


gga 


tea 


acc cas 


cac 


cga 


aga 


aaa 


aca 


acc 


aga 


aga 


aat 


tat 


375 




Pro 


Gly 


Ser 


Thr Xaa 


His 


Arg 


Arg 


Lys 


Thr 


Thr Arg Arg 


Asn 


Tyr 




ko 








35 










40 










45 





^hct tea gcc tgaaatgaak ccgggatcaa atggttgctg atcaragccc 424 
Ojlser Ser Ala 



atatttaaat tggaaaagtc aaattgasca ttattaaata aagcttgttt aatatgtctc 484 
aaacaaaaaa aa 496 



<210> 24 
<211> 15 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> 1. .15 

<223> Von Heijne matrix 
score 5.5 

seq ILSTVTALTFAXA/LD 
<400> 24 

Met Gly He Leu Ser Thr Val Thr Ala Leu Thr Phe Ala Xaa Ala 
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10 



15 



<210> 25 

<211> 623 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> sig_peptide 

<222> 49. .96 

<223> Von Heijne matrix 

<400> 25 

aaagatccct gcagcccggc aggagagaag gctgagcctt ctggcgtc atg gag agg 57 

Met Glu Arg 
-15 



etc 


gtc 


eta 


aec 


ctg 


tge 


aec 


etc 


ceg 


ctg 


get 


gtg 


geg 


tet 


get 


ggc 


105 


Leu 


Val 


Leu 


Thr 
-10 


Leu 


Cys 


Thr 


Leu 


Pro 
-5 


Leu 


Ala 


Val 


Ala 


Ser 
1 


Ala 


Gly 






gcc 


aeg 


aeg 


cca 


get 


cgc 


aae 


etg 


age 


tge 


tae 


cag 


tge 


ttc 


aag 


153 




Ala 


Thr 


Thr 


Pro 


Ala 


Arg 


Asn 


Leu 


Ser 


Cys 


Tyr 


Gin 


Cys 


Phe 


Lys 






5 










10 










15 














age 


age 


tgg 


aeg 


gag 


tge 


ceg 


ccc 


ace 


tgg 


tge 


age 


ceg 


ctg 


gae 


201 


t^al 


Ser 


Ser 


Trp 


Thr 


Glu 


Cys 


Pro 


Pro 


Thr 


Trp 


Cys 


Ser 


Pro 


Leu 


Asp 




ir?o 










25 










30 










35 




,|;baa 


gtc 


tge 


ate 


tec 


aae 


gag 


gtg 


gtc 


gtc 


tet 


ttt 


aaa 


tgg 


agt 


gta 


249 


iUpin 


Val 


Cys 


He 


Ser 
40 


Asn 


Glu 


Val 


Val 


Val 
45 


Ser 


Phe 


Lys 


Trp 


Ser 
50 


Val 






gtc 


ctg 


etc 


age 


aaa 


cgc 


tgt 


get 


ccc 


aga 


tgt 


ccc 


aae 


gac 


aac 


297 


r Arg 


Val 


Leu 


Leu 


Ser 


Lys 


Arg 


Cys 


Ala 


Pro 


Arg 


Cys 


Pro 


Asn 


Asp 


Asn 








55 










60 










65 








ii ^^tg 


aak 


ttc 


gaa 


tgg 


teg 


ccg 


gee 


ccc 


atg 


gtg 


caa 


ggc 


gtg 


ate 


aec 


345 




Xaa 


Phe 


Glu 


Trp 


Ser 


Pro 


Ala 


Pro 


Met 


Val 


Gin 


Gly 


Val 


He 


Thr 








70 










75 










80 










^ll^gg 


cgc 


tge 


tgt 


tec 


tgg 


get 


etc 


tge 


aac 


agg 


gea 


ctg 


aec 


cca 


cag 


393 




Arg 


Cys 


Cys 


Ser 


Trp 


Ala 


Leu 


Cys 


Asn 


Arg 


Ala 


Leu 


Thr 


Pro 


Gin 




85 










90 










95 












gag 


ggg 


cgc 


tgg 


gee 


etg 


era 


ggg 


ggg 


etc 


etg 


etc 


cag 


gae 


ect 


teg 


441 


Glu 


Gly 


Arg 


Trp 


Ala 


Leu 


Xaa 


Gly 


Gly 


Leu 


Leu 


Leu 


Gin 


Asp 


Pro 


Ser 




100 










105 










110 










115 




agg 


ggc 


ara 


aaa 


ace 


tgg 


gtg 


egg 


cca 


cag 


etg 


ggg 


etc 


cca 


etc 


tge 


489 


Arg 


Gly 


Xaa 


Lys 


Thr 
120 


Trp 


Val 


Arg 


Pro 


Gin 
125 


Leu 


Gly 


Leu 


Pro 


Leu 
130 


Cys 




ctt 


ccc 


awt 


tee 


aae 


ccc 


etc 


tge 


cca 


rgg 


gaa 


ace 


cag 


gaa 


gga 




534 


Leu 


Pro 


Xaa 


Ser 


Asn 


Pro 


Leu 


Cys 


Pro 


Xaa 


Glu 


Thr 


Gin 


Glu 


Gly 







135 140 145 



taaeaetgtg ggtgccceea ectgtgeatt gggaeeaera cttcacccte ttggaracaa 594 
taaaetctea tgcceecaaa aaaaaaaaa 623 



<210> 26 
<211> 16 
<212> PRT 

<213> Homo sapiens 
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<220> 

<221> SIGNAL 
<222> 1. .16 

<223> Von Heijne matrix 
score 10.1 

seq LVLTLCTLPLAVA/SA 
<400> 26 

Met Glu Arg Leu Val Leu Thr Leu Cys Thr Leu Pro Leu Ala Val Ala 
15 10 15 



<210> 27 

<211> 848 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> sig_peptide 
,,,,<222> 32. .73 
';;:i<223> Von Heijne matrix 

llc400> 27 

"%actttgcct tgtgttttcc accctgaaag a atg ttg tgg ctg etc ttt ttt 52 
Jl Met Leu Trp Leu Leu Phe Phe 

p -10 

j ptg gtg act gcc att cat get gaa etc tgt caa cea ggt gca gaa aat 100 
Vieu Val Thr Ala He His Ala Glu Leu Cys Gin Pro Gly Ala Glu Asn 

-5 1 5 

' get ttt aaa gtg aga ctt agt ate aga aca get ctg gga gat aaa gea 148 
;'^la Phe Lys Val Arg Leu Ser He Arg Thr Ala Leu Gly Asp Lys Ala 
'^4o 15 20 25 

^^%at gee tgg gat ace aat gaa gaa tac etc tte aaa gcg atg gta get 196 
^^iryr Ala Trp Asp Thr Asn Glu Glu Tyr Leu Phe Lys Ala Met Val Ala 
|;i 30 3 5 40 

;ftte tee atg aga aaa gtt ecc aac aga gaa gca aea gaa att tec cat 244 
"'Phe Ser Met Arg Lys Val Pro Asn Arg Glu Ala Thr Glu He Ser His 

45 50 55 

gte eta ctt tgc aat gta ace cag agg gta tea tte tgg ttt gtg gtt 2 92 

Val Leu Leu Cys Asn Val Thr Gin Arg Val Ser Phe Trp Phe Val Val 

60 65 70 

aca gae cet tea aaa aat cac ace ctt ect get gtt gag gtg caa tea 340 
Thr Asp Pro Ser Lys Asn His Thr Leu Pro Ala Val Glu Val Gin Ser 

75 80 85 

gcc ata aga atg aac aag aac egg ate aac aat gee tte ttt eta aat 388 
Ala He Arg Met Asn Lys Asn Arg He Asn Asn Ala Phe Phe Leu Asn 
90 95 100 105 

gae caa act ctg gaa ttt tta aaa ate cet tec aca ctt gca cea ecc 436 
Asp Gin Thr Leu Glu Phe Leu Lys He Pro Ser Thr Leu Ala Pro Pro 

110 115 120 

atg gae cea tet gtg ecc ate tgg att att ata ttt ggt gtg ata ttt 484 
Met Asp Pro Ser Val Pro He Trp He He He Phe Gly Val He Phe 

125 130 135 

tgc ate ate ata gtt gea att gea eta ctg att tta tea ggg ate tgg 532 
Cys He He He Val Ala He Ala Leu Leu He Leu Ser Gly He Trp 
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±4 U 










145 








150 






Cd.3. 


cgt 


ada 


ara 


aag 


aac 


aaa 


gaa 


cca 


tCu 


gaa gtg 


gat 


gac 


get gaa 


Gin 


Arg 


Xaa 


Xaa 


Lys 


Asn 


Lys 


Glu 


Pro 


Ser 


Glu Val 


Asp Asp Ala Glu 




ICC 

155 










160 








165 








rac 


aak 


tgt 


gaa 


aac 


atg 


ate 


aca 


att 


gaa 


aat ggc 


ate 


ccc 


tet gat 


Xaa 


Xaa 


Cys 


Glu 


Asn 


Met 


lie 


Thr 


lie 


Glu 


Asn Gly 


lie 


Pro 


Ser Asp 


170 










175 










180 






185 


ccc 


ctg 


gac 


atg 


aag 


gga 


ggg 


cat 


att 


aat 


gat gcc 


ttc 


atg 


aea gag 


Pro 


Leu 


Asp 


Met 


Lys 
190 


Gly 


Gly 


His 


lie 


Asn 
195 


Asp Ala 


Phe 


Met 


Thr Glu 
200 


gat 


gag 


agg 


etc 


acc 


cot 


etc 


tgaagggctg ttgttctgct tcctcaaraa 


Asp 


Glu 


Arg 


Leu 
205 


Thr 


Pro 


Leu 

















580 



628 



676 



727 



attaaaeatt tgtttctgtg tgaetgctga gcatcctgaa ataccaagag cagatcatat 787 
wttttgttte aeeattcttc ttttgtaata aattttgaat gtgcttgaaa aaaaaaaaaa 847 
c 848 



<210> 28 

<211> 14 
,,<212> PRT 
^^1^213 > Homo sapiens 

^k221> SIGNAL 
J1c222> 1. .14 

13:223 > Von Heijne matrix 

jj score 10.7 

]S seq LWLLFFLVTAIHA/EL 

,<4 00> 28 

r'l^et Leu Trp Leu Leu Phe Phe Leu Val Thr Ala lie His Ala 
5 10 



i;K210> 29 

'<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 29 

gggaagatgg agatagtatt geetg 25 



<210> 30 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Olignueleotide 



<400> 30 

ctgccatgta catgatagag agattc 



<210> 31 

<211> 546 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> promoter 
<222> 1. .517 

<221> transcription start site 
<222> 518 

<221> protein_bind 
<222> 17 . .25 

<223> matinspector prediction 

name CMYB_01 

score 0.983 
f sequence tgtcagttg 

"^\221> protein_bind 
rk222> complement (18 27) 
|!k223> matinspector prediction 
name MY0D__Q6 
score 0.961 
sequence cccaactgac 

./sc221> protein_bind 
^k222> complement (75 85) 

"*<:223> matinspector prediction 

"4 name S8_01 

3 score 0.96 0 

1 sequence aatagaattag 

<221> protein_bind 
<222> 94 . .104 

<223> matinspector prediction 
name S8_01 
score 0.966 
sequence aactaaattag 

<221> protein_bind 

<222> complement (129 .. 139) 

<223> matinspector prediction 

name DELTAEF1_01 

score 0.960 

sequence gcacacctcag 

<221> protein_bind 
<222> complement (155. .165) 
<223> matinspector prediction 
name GATA C 



score 0.964 
sequence agataaatcca 

<221> protein_bind 
<222> 170 . .178 

<223> matinspector prediction 
name CMYB_01 
score 0.958 
sequence cttcagttg 

<221> protein_bind 
<222> 176. .189 

<223> matinspector prediction 
name GATA1__02 
score 0.959 

sequence ttgtagataggaca 

<221> protein_bind 
<222> 180. .190 

<223> matinspector prediction 
name GATA_C 
score 0.953 
J sequence agataggacat 

^'=^'<:221> protein_bind 
';j;<222> 284 . .299 

Jl<223> matinspector prediction 
'i name TAL1ALPHAE47_01 

I score 0.973 

1 sequence cataacagatggtaag 

^^<22l> protein__bind 
'^222> 284. .299 

^|^<223> matinspector prediction 

name TAL1BETAE47_01 
4 score 0.983 

;| sequence cataacagatggtaag 

p 

<221> protein_bind 
<222> 284. .299 

<223> matinspector prediction 
name TAL1BETAITF2_01 
score 0.978 

sequence cataacagatggtaag 

<221> protein_bind 

<222> complement (287. .296) 

<223> matinspector prediction 

name MY0D_Q6 

score 0 . 954 

sequence accatctgtt 

<221> protein_bind 

<222> complement (302 . .314) 

<223> matinspector prediction 

name GATA1_04 

score 0.953 
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sequence tcaagataaagta 

<221> protein__bind 
<222> 393 . .405 

<223> mat inspector prediction 
name IK1_01 
score 0.963 

sequence agttgggaattcc 

<221> protein_bind 
<222> 393 . .404 

<223> matinspector prediction 
name IK2_01 
score 0.985 
sequence agttgggaattc 

<221> protein_bind 
<222> 396. .405 

<223> matinspector prediction 
name CREL__01 
score 0.962 
.;=s.^ sequence tgggaattcc 

^:!'^221> protein_bind 
f^^222> 423 . .436 

^"^^<223> matinspector prediction 
in name GATA1_02 

score 0.950 
fjj sequence tcagtgatatggca 

<221> protein_bind 
J .5222> complement (478 . .489) 
L*^223> matinspector prediction 
;;'f name SRY_02 

score 0.951 
"'4 sequence taaaacaaaaca 

;iT|c221> protein^bind 
<222> 486. .493 

<223> matinspector prediction 
name E2F_02 
score 0.957 
sequence tttagcgc 

<221> protein_bind 
<222> complement (514 .. 521) 
<223> matinspector prediction 
name MZF1_01 
score 0.975 
sequence tgagggga 



<400> 31 

tgagtgcagt gttacatgtc agttgggtta agtttgttaa tgtcattcaa atcttctatg 60 

tcttgatttg cctgctaatt ctattatttc tggaactaaa ttagtttgat ggttctatta 120 

gttattgact gaggtgtgct aatctcccat tatgtggatt tatctatttc ttcagttgta 180 

gataggacat tgatagatac ataagtacca ggacaaaagc agggagatct tttttccaaa 240 

atcaggagaa aaaaatgaca tctggaaaac ctatagggaa aggcataaca gatggtaagg 300 
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atactttatc ttgagtagga gagccttcct gtggcaacgt ggagaaggga agaggtcgta 360 

gaattgagga gtcagctcag ttagaagcag ggagttggga attccgttca tgtgatttag 420 

catcagtgat atggcaaatg tgggactaag ggtagtgatc agagggttaa aattgtgtgt 480 

tttgttttag cgctgctggg gcatcgcctt gggtcccctc aaacagattc ccatgaatct 540 

cttcat 546 



<210> 32 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 
<400> 32 

gtaccaggga ctgtgaccat tgc 23 



,,,<210> 33 
=:4:211> 24 
At212> DNA 

^;lc213> Artificial Sequence 

J'fe22 0> 

15^223 > Oligonucleotide 

1!^400> 33 

ctgtgaccat tgctcccaag agag 24 



^"^^210> 34 

%|c211> 861 

||:212> DNA 

V|c:213> Homo sapiens 

<220> 

<221> promoter 

<222> 1. .806 



<221> transcription start site 
<222> 807 



<221> protein_bind 
<222> complement (60 70) 
<223> matinspector prediction 

name NFY_Q6 

score 0.956 

sequence ggaccaatcat 

<221> protein__bind 
<222> 70 , . 77 

<223> matinspector prediction 
name MZFl 01 



score 0.962 
sequence cctgggga 

<221> protein_bind 
<222> 124. .132 

<223> mat inspector prediction 
name CMYB_01 
score 0.994 
sequence tgaccgttg 

<221> protein_bind 
<222> complement (126 . .134) 
<223> mat inspector prediction 
name VMYB_02 
score 0.985 
sequence tccaacggt 

<221> protein_bind 
<222> 135 . .143 

<223> matinspector prediction 

name STAT_01 

score 0.968 
f sequence ttcctggaa 

^k221> protein_bind 

'^^■<222> complement (135. .143) 

J'fe223> matinspector prediction 

1^ name STAT_01 

I score 0.951 

% sequence ttccaggaa 

<221> protein_bind 
^5222> complement (252. ,259) 
■ik223> matinspector prediction 

name MZF1_01 
f| score 0.956 

I J sequence ttggggga 

<221> protein_bind 
<222> 357. .368 

<223> matinspector prediction 
name IK2_01 
score 0.965 
sequence gaatgggatttc 

<221> protein_bind 
<222> 384 . .391 

<223> matinspector prediction 
name MZF1_01 
score 0.986 
sequence agagggga 

<221> protein_bind 

<222> complement (410. .421) 

<223> matinspector prediction 

name SRY_02 

score 0.955 
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sequence gaaaacaaaaca 

<221> protein_bind 
<222> 592. ,599 

<223> matinspector prediction 
name MZF1__01 
score 0.96 0 
sequence gaagggga 

<221> protein_bind 
<222> 618 . .627 

<223> matinspector prediction 
name MY0D_Q6 
score 0.981 
sequence agcatctgcc 

<221> protein_bind 
<222> 632. .642 

<223> matinspector prediction 
name DELTAEF1_01 
score 0.958 
sequence tcccaccttcc 

■^221> protein_bind 

n.l;222> complement (813 . .823) 

i-^<223> matinspector prediction 

ifl name S8_01 

Jp score 0.992 

Id sequence gaggcaattat 

;''<221> protein_bind 
^^222> complement (824 . .831) 
:i^^'<:223> matinspector prediction 
^'^ name iy[ZFl_01 

score 0.986 

sequence agagggga 

ji]^400> 34 

tactataggg cacgcgtggt cgacggccgg gctgttctgg agcagagggc atgtcagtaa 60 

tgattggtcc ctggggaagg tctggctggc tccagcacag tgaggcattt aggtatctct 120 

cggtgaccgt tggattcctg gaagcagtag ctgttctgtt tggatctggt agggacaggg 180 

ctcagagggc taggcacgag ggaaggtcag aggagaaggs aggsarggcc cagtgagarg 240 

ggagcatgcc ttcccccaac cctggcttsc ycttggymam agggcgktty tgggmacttr 300 

aaytcagggc ccaascagaa scacaggccc aktcntggct smaagcacaa tagcctgaat 360 

gggatttcag gttagncagg gtgagagggg aggctctctg gcttagtttt gttttgtttt 420 

ccaaatcaag gtaacttgct cccttctgct acgggccttg gtcttggctt gtcctcaccc 480 

agtcggaact ccctaccact ttcaggagag tggttttagg cccgtggggc tgttctgttc 540 

caagcagtgt gagaacatgg ctggtagagg ctctagctgt gtgcggggcc tgaaggggag 6 00 

tgggttctcg cccaaagagc atctgcccat ttcccacctt cccttctccc accagaagct 660 

tgcctgagct gtttggacaa aaatccaaac cccacttggc tactctggcc tggcttcagc 72 0 

ttggaaccca atacctaggc ttacaggcca tcctgagcca ggggcctctg gaaattctct 780 

tcctgatggt cctttaggtt tgggcacaaa atataattgc ctctcccctc tcccattttc 840 

tctcttggga gcaatggtca c 861 



<210> 35 



<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 
<400> 35 

ctgggatgga aggcacggta 



<210> 36 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 36 
...gragaccacac agctagacaa 



==^^^210> 37 

J'l;211> 555 

|i<212> DNA 

j<213> Homo sapiens 

f <220> 

x221> promoter 
■^222> 1. .500 

'^<:221> transcription start site 
^1:2 22 > 501 

;^|:221> protein jDind 
<222> 191. .206 

<223> matinspector prediction 
name ARNT_01 
score 0.964 

sequence ggactcacgtgctgct 

<221> protein_bind 
<222> 193. .204 

<223> matinspector prediction 
name NMYC__01 
score 0.965 
sequence actcacgtgctg 

<221> protein_bind 
<222> 193. .204 

<223> matinspector prediction 
name USF_01 
score 0.985 
sequence actcacgtgctg 



<221> protein_bind 

<222> complement {193 , .204) 

<223> matinspector prediction 

name USF__01 

score 0.985 

sequence cagcacgtgagt 

<221> protein__bind 

<222> complement (193 . .204) 

<223> matinspector prediction 

name NMYC_01 

score 0.956 

sequence cagcacgtgagt 

<221> protein_bind 

<222> complement (193 . .204) 

<223> matinspector prediction 

name iyiYCMAX_02 

score 0.972 

sequence cagcacgtgagt 

'['k221> protein_bind 
l^l<222> 195. .202 

^ik22 3> matinspector prediction 

name USF_C 
j1 score 0.997 

|p sequence tcacgtgc 

|!|<221> protein_bind 
'<222> complement (195. .202) 
^^<223> matinspector prediction 
l^] name USF_C 

''^ score 0.991 

sequence gcacgtga 

lj|<221> protein_bind 

]Y222> complement (210 . .217) 
<223> matinspector prediction 
name iyiZFl_01 
score 0.968 
sequence catgggga 

<221> protein_bind 
<222> 397 . .410 

<223> matinspector prediction 
name ELK1_02 
score 0.963 

sequence ctctccggaagcct 

<221> protein_bind 
<222> 400. .409 

<223> matinspector prediction 
name CETS1P54_01 
score 0.974 
sequence tccggaagcc 
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<221> protein__bind 

<222> complement (460. .470) 

<223> matinspector prediction 

name AP1_Q4 

score 0.963 

sequence agtgactgaac 

<221> protein__bind 

<222> complement (460 . .470) 

<223> matinspector prediction 

name AP1FJ_Q2 

score 0.961 

sequence agtgactgaac 

<221> protein_bind 
<222> 547. .555 

<223> matinspector prediction 
name PADS_C 
score 1.000 
sequence tgtggtctc 

^,^<400> 37 

i^-tJtatagggca cgcktggtcg acggcccggg ctggtctggt ctgtkgtgga gtcgggttga 60 

;;l4ggacagcat ttgtkacatc tggtctactg caccttccct ctgccgtgca cttggccttt 120 

M^awaagctca gcaccggtgc ccatcacagg gccggcagca cacacatccc attactcaga 180 

^^ggaactgac ggactcacgt gctgctccgt ccccatgagc tcagtggacc tgtctatgta 240 

Ij'^agcagtcag acagtgcctg ggatagagtg agagttcagc cagtaaatcc aagtgattgt 300 

J?^ttcctgtc tgcattagta actcccaacc tagatgtgaa aacttagttc tttctcatag 360 

Ij^ttgctctgc ccatggtccc actgcagacc caggcactct ccggaagcct ggaaatcacc 420 

yfgtgtcttct gcctgctccc gctcacatcc cacacttgtg ttcagtcact gagttacaga 480 

■'fctttgcctcc tcaatttctc ttgtcttagt cccatcctct gttcccctgg ccagtttgtc 540 

"tagctgtgtg gtctc 555 



-^210> 38 
j;|:211> 19 
'^212> DNA 

'<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 
<400> 38 

ggccatacac ttgagtgac 19 



<210> 39 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 



<400> 39 
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atatagacaa acgcacacc 19 



<210> 40 

<211> 568 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 7. .471 

<221> sig_peptide 
<222> 7. .99 

<223> Von Heijne matrix 
score 6 . 9 

seq LLLVPSALSLLLA/LL 

<221> polyA_signal 
<222> 537. .542 

'^|:^221> polyA_site 
j'^222> 554. .568 

^^ik400> 40 

f&ggacc atg ttc acc age acc ggc tec agt ggg etc tae aag geg cet 48 
p Met Phe Thr Ser Thr Gly Ser Ser Gly Leu Tyr Lys Ala Pro 

jj -30 -25 -20 

Jiptg teg aag age ctt etg etg gtc eee agt gee etc tec etc ctg etc 96 
"Leu Ser Lys Ser Leu Leu Leu Val Pro Ser Ala Leu Ser Leu Leu Leu 

-10 -5 
;^'gcc etc etc etg cet eac tge cag aag eee ttt gtg tat gac ctt eac 144 
='kla Leu Leu Leu Pro His Cys Gin Lys Pro Phe Val Tyr Asp Leu His 

15 10 15 

sfeca gtc aag aac gac ttc cag att tgg agg ttg ata tgt gga aga ata 192 
J|Ala Val Lys Asn Asp Phe Gin He Trp Arg Leu He Cys Gly Arg He 
3 20 25 30 

att tge ctt gat ttg aaa gat act ttc tge agt agt ctg ctt att tat 240 
He Cys Leu Asp Leu Lys Asp Thr Phe Cys Ser Ser Leu Leu He Tyr 

35 40 45 

aat ttt agg ata ttt gaa aga aga tat gga age aga aaa ttt gea tec 288 
Asn Phe Arg He Phe Glu Arg Arg Tyr Gly Ser Arg Lys Phe Ala Ser 

50 55 60 

ttt ttg ctg ggt acc tgg gtt ttg tea gcc tta ttt gac ttt etc etc 336 
Phe Leu Leu Gly Thr Trp Val Leu Ser Ala Leu Phe Asp Phe Leu Leu 

65 70 75 

att gaa get atg cag tat ttc ttt ggc ate act gea get agt aat ttg 384 
He Glu Ala Met Gin Tyr Phe Phe Gly He Thr Ala Ala Ser Asn Leu 
80 85 90 95 

cet tet gga tta ate ttt tgt tgt get ttt tge tct gag act aaa etc 432 
Pro Ser Gly Leu He Phe Cys Cys Ala Phe Cys Ser Glu Thr Lys Leu 

100 105 110 

ttc tta tea aga caa get atg gea gag aac ttt tec ate taataaattt 481 
Phe Leu Ser Arg Gin Ala Met Ala Glu Asn Phe Ser He 

115 120 
aagagtagat tcatctgtat ggttgagagt aggetctgac tatgtatatg tgtataataa 541 
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acctacatat ccaaaaaaaa aaaaaaa 568 



<210> 41 

<211> 569 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> 168. .332 



<221> polyA_signal 
<222> 557. .562 



<400> 41 

^gggggcgtg gggccatggt ggtcttgcgg gcggggaaga agacctttct cccccctctc 60 
tgccgcgcct tcgcctgccg cggctgtcaa ctcgctccgg agcgcggcgc cgagcgcagg 120 
gatacggcgc ccagcggggt cagaaagcaa cattgaatgc agaagaa atg gcg gac 176 

Met Ala Asp 

^=-^^ttc tac aag gaa ttt tta agt aaa aat ttt cag aag cgc atg tat tat 224 
^IPhe Tyr Lys Glu Phe Leu Ser Lys Asn Phe Gin Lys Arg Met Tyr Tyr 

PJ 5 10 15 

^iaac aga gat tgg tac aag cgc aat ttt gee ate ace ttc ttc atg gga 272 
yl^sn Arg Asp Trp Tyr Lys Arg Asn Phe Ala lie Thr Phe Phe Met Gly 
J?0 25 30 35 

gtg gee ctg gaa agg att tgg aac aag ctt aaa cag aaa caa aag 32 0 

f;<rii:jys Val Ala Leu Glu Arg lie Trp Asn Lys Leu Lys Gin Lys Gin Lys 

40 45 50 

r^^aag agg age aac taggagtcca ctctgaccca gccagagtcc aggtttccac 372 
^!pys Arg Ser Asn 
in J 55 

|i='^feggaagcaga tggagctcct ttcacagggg ctctgagaaa aactggagcc gatctcaaga 432 
Skgccccacat cttcctaagg ggccccatgg cctgtttggg ggcagggtag gtcctggggc 492 
tfaetgtgggcc gcctgcctgc tgatgtgggc tctaggccag cttgttgtca cgtacgtggt 552 
iivgtgaaataaa gcccaag 569 



<210> 42 
<211> 895 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 51. .251 

<221> sig__peptide 
<222> 51. .110 
<223> Von Heijne matrix 
score 5 . 3 

seq ALIFGGFISLIGA/AF 



<221> polyA_signal 



-24- 



<222> 849. .854 

<221> polyA_site 
<222> 882 . .895 



<400> 42 

ccgagagtgc cgggcggtcg gcgggtcagg gcagcccggg gcctgacgcc atg tec 56 

Met Ser 
-20 

egg aac ctg cgc acc gcg etc att ttc ggc ggc ttc ate tec etg ate 104 
Arg Asn Leu Arg Thr Ala Leu He Phe Gly Gly Phe He Ser Leu He 

-15 -10 -5 

ggc gcc gcc ttc tat cce ate tac ttc egg ecc eta atg aga ttg gag 152 
Gly Ala Ala Phe Tyr Pro He Tyr Phe Arg Pro Leu Met Arg Leu Glu 

15 10 
gag tae aag aag gaa eaa get ata aat egg get gga att gtt caa gag 200 
Glu Tyr Lys Lys Glu Gin Ala He Asn Arg Ala Gly He Val Gin Glu 
15 20 25 30 

gat gtg cag cea cca ggg tta aaa gtg tgg tet gat eca ttt ggc agg 248 
Asp Val Gin Pro Pro Gly Leu Lys Val Trp Ser Asp Pro Phe Gly Arg 
35 40 45 

,^,„^aaa tgagaggget gteatcaget etgattaaga aaggagattt cttcatgctt 3 01 

^^^.tcgattctgc atggggtaca gccagteace teaeeagaga atgacggctg gagaagaaaa 361 

^'.fctctgtaata ecataaataa gagtgcttgt aataaaagae tgtgcaeaag gattaatatt 421 

^^iteccttetta agtatcaaaa gaactetgga acaaattata eeattaggaa ggttttcatg 481 

y>ttcagttga tttteeaaaa atgaagctat etcacceagc tgggtttgga ggagcaatet 541 

j^ettattatt ctgtcgttae cacttactca agegagetgt gatatgaata caagcaaeea 601 

iyptgggctegg gaaggtcegg gtctettctg ceatcttcca gataagagat tteagtaaaa 661 

IjSpactgeeatg ctgagetgcc ttatagaget ettegaaaat gttcgagttg ataaagctet 721 

■Jttgaggacaa ggtaettcgt geaccteatg ctgaagattg caccatgttg gaagataaat 781 

atgaagcaag teaaactaga tgcataeact tgtgtagaaa tcaataatea attaatagaa 841 

Jl'^tgaaaaaat agaeattaag atgatttatt teeactttgc aaaaaaaaaa aaaa 8 95 



:.|J:210> 43 
rjSc211> 691 
"''<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 20. .613 

<221> sig_jpeptide 
<222> 20. .82 
<223> Von Heijne matrix 
score 10 

seg LWALAMVTRPASA/AP 
<400> 43 

ataccttaga ccctcagtc atg cca gtg eet get ctg tgc etg etc tgg gcc 52 

Met Pro Val Pro Ala Leu Cys Leu Leu Trp Ala 
-20 -15 
ctg gea atg gtg acc egg eet gcc tea gcg gcc cce atg ggc ggc cca 100 
Leu Ala Met Val Thr Arg Pro Ala Ser Ala Ala Pro Met Gly Gly Pro 



-25- 



-10 -5 15 

gaa ctg gca cag cat gag gag ctg acc ctg etc ttc cat ggg acc ctg 148 

Glu Leu Ala Gin His Glu Glu Leu Thr Leu Leu Phe His Gly Thr Leu 

10 15 20 

cag ctg ggc cag gcc etc aac ggt gtg tac agg acc acg gag gga tgg 196 
Gin Leu Gly Gin Ala Leu Asn Gly Val Tyr Arg Thr Thr Glu Gly Trp 

25 30 35 

ctg aca aag gcc agg aac age ctg ggt etc tat ggc cgc aca ata gaa 244 
Leu Thr Lys Ala Arg Asn Ser Leu Gly Leu Tyr Gly Arg Thr lie Glu 

40 45 50 

etc ctg ggg cag gag gtc age egg ggc egg gat gca gcc cag gaa ett 292 
Leu Leu Gly Gin Glu Val Ser Arg Gly Arg Asp Ala Ala Gin Glu Leu 
55 60 65 70 

egg gca age ctg ttg gag act cag atg gag gag gat att ctg cag ctg 340 
Arg Ala Ser Leu Leu Glu Thr Gin Met Glu Glu Asp He Leu Gin Leu 

75 80 85 

cag gca gag gcc aca get gag gtg ctg ggg gag gtg gee cag gca cag 3 88 

Gin Ala Glu Ala Thr Ala Glu Val Leu Gly Glu Val Ala Gin Ala Gin 
90 95 100 

gtg eta egg gae age gtg cag egg eta gaa gtc cag ctg agg age 436 
Lys Val Leu Arg Asp Ser Val Gin Arg Leu Glu Val Gin Leu Arg Ser 
„ 105 110 115 

f&ce tgg ctg ggc cet gcc tac ega gaa ttt gag gtc tta aag get cac 484 
4la Trp Leu Gly Pro Ala Tyr Arg Glu Phe Glu Val Leu Lys Ala His 
J 120 125 130 

i^get gae aag cag age cac ate eta tgg gee etc aca ggc cac gtg cag 532 

'l^la Asp Lys Gin Ser His He Leu Trp Ala Leu Thr Gly His Val Gin 

;'^35 140 145 150 

pgg cag agg egg gag atg gtg gca cag cag cat egg ctg ega cag ate 580 

^f^rg Gin Arg Arg Glu Met Val Ala Gin Gin His Arg Leu Arg Gin He 

155 160 165 

„eag gag aga etc cac aca geg geg etc cea gee tgaatetgce tggatggaac 633 
'pin Glu Arg Leu His Thr Ala Ala Leu Pro Ala 
^ 170 175 

^gaggaccaa tcatgctgca aggaacactt ecacgceecg tgaggcccct gtgeaggg 691 



<210> 44 

<211> 458 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 12 . .416 



<221> sig_peptide 
<222> 12. .86 
<223> Von Heijne matrix 
score 4 

seq LWMVPLVGLIHL/GW 

<221> polyA_signal 
<222> 425. .430 



<221> polyA__site 
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<222> 445. .458 



<400> 44 

gctgaagtac t atg age ctt egg aac ttg tgg aga gac tac aaa gtt ttg 50 
Met Ser Leu Arg Asn Leu Trp Arg Asp Tyr Lys Val Leu 
-25 -20 -15 







;5 tcT 
duy 




r-~l /-I -J- 




gzz 


ggg 


etc 


ata 


cat 


cug 


ggg 


tgg 


tac 


aga 


98 


Val 


Val 






ir X. 




veil. 


Gly Leu 


He 


IT-! c* 

HIS 


Leu 


Cjiy 


Trp 


Tyr 


Arg 
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-3 a O 


age 


age 


cct 


yuu 


UuC 


caa 


ata 


cct 


aaa 


aac 


gac 


gac 


att 


cct 


146 


JL 


T .^70 

y *^ 


C V 






V a.X 


Fne 


Gin 


He 


Pro 


Lys 


Asn 


Asp 


Asp 


He 


Pro 




D 




















15 










20 




y^g 


caa 


rra t- 
y a. U 


agt 


ctg 




CUU 


tea 


aat 


ctt 


cag 


aag 


age 


caa 


ate 


cag 


194 


oXU 




Asp 


Ser 


Leu 

2 b 


Cjiy 


Leu 


Ser 


Asn 


Leu 
30 


Gin 


Lys 


Ser 


Gin 


He 
35 


Gin 






aag 


nta 


gca 




ttg 


caa 


tet 


tea 


ggt 


aaa 


gaa 


gca 


get 


ttg 


aat 


242 


Gly 


Lys 


Xaa 


Ala 


Gly 


Leu 


Gin 


Ser 


Ser Gly 


Lys 


Glu 


Ala 


Ala 


Leu 


Asn 










40 










45 










50 








ctg 


age 


ttc 


ata 


teg 


aaa 


gaa 


gag 


atg 


aaa 


aat 


ace 


agt 


tgg 


att 


aga 


290 


Leu 


Ser 


Phe 


He 


Ser 


Lys 


Glu 


Glu 


Met 


Lys 


Asn 


Thr 


Ser 


Trp 


He 


Arg 








55 










60 










65 








.^..^aag 


aac 


tgg 


ctt 


ctt 


gta 


get 


ggg 


ata 


tet 


ttc 


ata 


ggt 


gac 


eat 


ctt 


338 




Asn 


Trp 


Leu 


Leu 


Val 


Ala 


Gly He 


Ser 


Phe 


He 


Gly 


Asp 


His 


Leu 






70 










75 










80 












I ^^ga 


aca 


tac 


ttt 


ttg 


cag 


agg 


tet 


gca 


aag 


cag 


tet 


gta 


aaa 


ttt 


cag 


386 


l^^^Gly 


Thr 


Tyr 


Phe 


Leu 


Gin 


Arg 


Ser 


Ala 


Lys 


Gin 


Ser 


Val 


Lys 


Phe 


Gin 




iri85 










90 










95 










100 






caa 


age 


aaa 


caa 


aag 


agt 


att 


gaa 


gag 


tgaagtaaaa taaatatttg 


436 


[jper 


Gin 


Ser 


Lys 


Gin 


Lys 


Ser 


He 


Glu 


Glu 
























105 










110 
















■* gaattactaa aaaaaaaaaa aa 




















458 



iH^210> 45 

'^i:211> 2036 

U;|c212> DNA 

n||:213> Homo sapiens 

<220> 

<221> CDS 

<222> 276. .1040 



<221> sig_peptide 
<222> 276 . .485 
<223> Von Heijne matrix 
score 3 . 9 

seq SVIGVMLAPFTAG/LS 



<221> polyA_site 
<222> 2024. .2036 



<400> 45 

gatcctgggt gcagctcate acaagcgtcg gggtgcagca aaaccatcca ggctggacag 60 

tggctggaea gttccaagaa aagaaacgct tcacegaaga agtcattgaa tacttccaga 120 

agaaagttag cecagtgeat ctgaaaatee tgctgactag egatgaagcc tggaagagat 180 

tegtgegtgt ggctggattg cecagggaag aagcagatgc tctetatgaa gctetgaaga 240 



-27- 



atcttacacc atatgtggct attgaggaca aagac atg cag caa aaa gaa cag 293 

Met Gin Gin Lys Glu Gin 
-70 -65 



cag 




^gg 


g^g 


tgg 


1- -t- f- 


u ug 


aaa 


gag 


4- 4_ 4- 


ect 


caa 


ate 


aga 


tgg 


aag 


341 


Gin 


Phe 


Arg 


CjIU 


Trp 
-0 u 


Pne 


Leu 


Lys 


Glu 


Phe 
-55 


Pro 


Gin 


He 


Arg 


Trp 
-50 


Lys 




att 


cag 


g^g 


tec 


aca 


gaa 


agg 


ctt 


cgt 


gtc 


att 


gca 


aat 


gag 


att 


gaa 


389 


He 


Gin 


oJ-U 


Ser 


i±e 


CjIU 


Arg 


Leu 


Arg 
-40 


Val 


lie 


Ala 


Asn 


Glu 
-35 


He 


Glu 




aag 


gtc 


cac 


aga 


ggc 


tgc 


gtc 


ate 


gee 


aat 


gtg 


gtg 


tet 


ggc 


tee 


act 


437 


Lys 


Val 


rllS 


Arg 


Cjiy 


Cys 


Val 


He 


Ala 


Asn 


Val 


Val 


Ser Gly 


Ser 


Thr 


















-25 
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Ctg 


4— J— 4. 

uct 


gtc 


att 


ggc 


gtt 


atg 


ttg 


gca 


cea 


ttt 
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gca 


ggg 


485 


Gly He 


Leu 


Ser 


Val 


He 


Gly 


Val 


Met 


Leu 


Ala 


Pro 


Phe 


Thr 


Ala Gly 






-15 










-10 










-5 












ctg 
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ctg 


age 


att 


act 


gca 


get 


ggg 


gta 


ggg 


ctg 


gga 


ata 


gca 


tet 


533 


Leu 


Ser 


Leu 


Ser 


He 


Thr 


Ala 


Ala Gly 


Val 


Gly 


Leu Gly He 


Ala 


Ser 




1 








5 










10 










15 






gcc 


a eg 


get 


ggg 


ate 


gee 


tec 


age 


ate 


gtg 


gag 


aac 


aca 


tac 


aca 


agg 


581 


Ala 


Thr 


Ala 


Gly 


He 


Ala 


Ser 


Ser 


He 


Val 


Glu 


Asn 


Thr 


Tyr 


Thr Arg 










20 










25 










30 








;i.^^ca 


gca 


gaa 


etc 


aca 


gcc 


age 


agg 


ctg 


act 


gca 


ace 


age 


act 


gac 


caa 


629 


:;^er Ala 


Glu 


Leu 


Thr 


Ala 


Ser 


Arg 


Leu 


Thr 


Ala 


Thr 


Ser 


Thr 


Asp 


Gin 








35 










40 










45 








'^^^tg gag 


gca 


tta 


agg 


gac 


att 


ctg 


cat 


gac 


ate 


aca 


cec 


aat 


gtg 


ctt 


677 


'^^Leu 


Glu 
50 


Ala 


Leu 


Arg 


Asp 


He 
55 


Leu 


His 


Asp 


He 


Thr 
60 


Pro 


Asn 


Val 


Leu 
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gca 
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ttt 


gac 


gaa 


gcc 


aca 


aaa 


atg 


att 


gcg 


aat 


gat 


725 


,! Ber 


Phe 


Ala 


Leu 


Asp 


Phe 


Asp 


Glu 


Ala 


Thr 


Lys 


Met 


He 


Ala 


Asn 


Asp 




if 5 
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He 
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^'^la 
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He 
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Glu 
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He 
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Lys 
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Asp Val 


Val 
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ggg 
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ttg 
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Asp 


Leu 
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Lys 


Gly Glu 


Lys 


Ser 


Glu 


Ser 


Ala 


Glu 


Leu 
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get 


cag 
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ctg 
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gag 
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aat 
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etc 
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1013 
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Gin 
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Glu 


Glu 


Asn 


Leu 
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Glu 


Leu 


Thr 
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ate 
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gca 
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taggeeeaat tgttgeggga 




1060 


His 


He 


His 


Gin 
180 


Ser 


Leu 


Lys 


Ala 


Gly 
185 



















agtcagggae cecaaaegga gggactggct gaagccatgg cagaagaacg tggattgtga 112 0 

agatttcatg gaeatttatt agtteceeaa attaataett ttataattte etatgcetgt 1180 

etttaeegea atetctaaac aeaaattgtg aagattteat ggacacttat cacttceeca 1240 

ateaataece ttgtgatttc ttatgcctgt etttacttta atetcctaat eetgtcagct 13 00 

gaggagggtg tatgtcaect caggaccatg tgataattgc gttaactgea caaattgtag 1360 

agcatgtgtg tttgaacaat atgaaatctg ggeaccttga aaaaagaaea ggataacage 1420 



-28- 



aatcgttcag gggataagag agataacctt aaactctgac caacagtgag ccgggtggag 1480 

cagagtcata tttcttttct ttcaaaagca aatgggagaa atatcgctga attctttttc 1540 

tcagcaagga acatccctga gaaagagaat gcacccctga gggtgggtct ataaatggcc 1600 

tccttgggtg tggccatctt ctatggtcga gactgtaggg atgaaataaa ccccagtctc 1660 

ccatagtgct cccaggctta ttaggaagag gaaattcccg cctaataaat tttggtcaga 1720 

ccggttgctc tcaaaaccct gtctcctgat aagatgttat caatgacaat ggtgcctgaa 1780 

acctcattag caattttaat ttctccccgg tcctgtggtc ctgtgatctc accctgcctc 1840 

cacttgcctt gtgatattct attaccttgt gaagtaggtg atctttgtga cccacaccct 1900 

attcatacac tccctcccct tttggaagtc cctaataaaa acttgctggt tttgcagctt 1960 

gtgaggcatc acggaaccta ctgatgtgtg atgtctcccc tggacaccta gctttaaaat 2020 

ttcaaaaaaa aaaaaa 2036 



<210> 46 

<211> 1276 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
,„„<222> 443 . .619 

5fik221> sig_jpeptide 
n.k222> 443. .589 
l-k223> Von Heijne matrix 
Ifi score 7 

seq LICWCLYIVCRC/GS 

^I^?c221> polyA^site 
' <222> 1267. .1276 

J:'>400> 46 

J-%aggcactca cggcatttca ttgctacttt aattttcatt attatgggat tgattgctgt 60 
j^^%:acagctact gctgcagtag ctggagttgc tttgcattcc acagtacaaa cagcagacta 120 
'^^tgtaaataat tggtagaaaa attctactct gctgtggaat taccaagata atatagacca 180 
yj^aaactagct gatcaaatta atgatctcca acaaactgta atgtggctag gggatcatat 240 
rij^gttagttta gaatatagaa tgcggttaca atgtgattga aatacctctg atttttgcat 300 
tactcctcat ctgtgtaatg aaacagagca tgagtgggaa aaagttaaga gatatttaaa 360 
aggtcatact agaaatttat ctttggatat tgcaaagcta aaggaacaag tatttcaagc 420 
ccctcagata catctgacac ta atg cca gga act gaa gtg ctt gaa gga get 472 

Met Pro Gly Thr Glu Val Leu Glu Gly Ala 
-45 -40 
aca gac gga tta gca get att aac ctg eta aaa tgg ate aag aea ctt 520 
Thr Asp Gly Leu Ala Ala lie Asn Leu Leu Lys Trp lie Lys Thr Leu 

-35 -30 -25 

gga ggc tct gtg att tea atg att gtg ett tta ate tgt gtt gtt tgt 568 
Gly Gly Ser Val He Ser Met He Val Leu Leu He Cys Val Val Cys 

-20 -15 -10 

ett tat ata gtc tgt aga tgc gga age eac etc tgg aga gaa age cac 616 
Leu Tyr He Val Cys Arg Cys Gly Ser His Leu Trp Arg Glu Ser His 

-5 15 
eac tgagagcaag caatgatagc tgtggcggtt ttgcaaaaag aaaagggaga 669 
His 
10 

caagcgccca gctatagtta ceaataaagc atggtactgg tattaaaata ggcatgtgtt 729 
etgttecaat ggaaeagaat agagaaeeca gaaaeaaage caaatattta cagecaactg 789 



-29- 



atctctgaca aagcaaacaa aaacataaag tggggaaagg acaccctatt ccacaaatag 849 

tgcagggata attggcaagc cacatgtaga aaaatgaagc tggatcctcg tctctcactt 909 

tatacaaaaa tcaactcaaa atgggtcaaa gtcttaactc taagacctga aaccataaca 969 

attctagaaa ataacattgg aaaaactctt ctagacattg gtttaggcaa aaagttcatg 1029 

accaagaacc caaaagcaaa tgcaataaaa aggaagataa atagatggga cctaattaag 1089 

ctgaaaagct tctgcatagc aaaaggaata atcagcagag caaacagaca acccacaggg 1149 

tgggagaaaa tatttgcaag ctatgtatct gacaatggac taatatccag aatctacaag 12 09 

gaattcaaac aattagcaag aaaaaacact tgtattgtgt ttgctctgta aatcagcaaa 1269 

aaaaaaa 1276 



<210> 47 

<211> 747 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 206 . . 745 

,,,,«c400> 47 

='^;accagaagca ggtgatttcc gagctcagca atgctcagct cataatgatg tcaagcacca 60 
■j4ggccagttt tatgaatggc ttcctgtgtc taatgaccct gacaacccat gttcactcaa 120 
M^gtgccaagcc aaaggaacaa ccctggttgt tgaactagca cctaaggtct tagatggtac 180 
^%cgttgctat acagaatctt tggat atg tgc ate agt ggt tta tgc caa att 232 
Jl Met Cys lie Ser Gly Leu Cys Gin lie 

P 15 

j^0tt ggc tgc gat cac cag ctg gga age acc gtc aag gaa gat aac tgt 280 
if^al Gly Cys Asp His Gin Leu Gly Ser Thr Val Lys Glu Asp Asn Cys 
'^0 15 2 0 2 5 

gtc tgc aac gga gat ggg tec acc tgc egg ctg gtc cga ggg cag 32 8 

Iply Val Cys Asn Gly Asp Gly Ser Thr Cys Arg Leu Val Arg Gly Gin 
■f 30 35 40 

^'^%at aaa tec cag etc tec gea acc aaa teg gat gat act gtg gtt gea 376 
siryr Lys Ser Gin Leu Ser Ala Thr Lys Ser Asp Asp Thr Val Val Ala 
0 45 50 55 

:i;^tt ccc tat gga agt aga cat att cge ctt gtc tta aaa ggt ect gat 424 
He Pro Tyr Gly Ser Arg His He Arg Leu Val Leu Lys Gly Pro Asp 

60 65 70 

cac tta tat ctg gaa acc aaa ace etc cag ggg act aaa ggt gaa aac 472 
His Leu Tyr Leu Glu Thr Lys Thr Leu Gin Gly Thr Lys Gly Glu Asn 

75 80 85 

agt etc age tee aca gga 
Ser Leu Ser Ser Thr Gly 
90 95 
tte cag aaa ttt cea gae 
Phe Gin Lys Phe Pro Asp 
110 

aca gea gat tte att gtc 
Thr Ala Asp Phe He Val 
125 

aca gtc eag tte ate tte 
Thr Val Gin Phe He Phe 
140 

aeg gat tte ttt cct tgc 
Thr Asp Phe Phe Pro Cys 



act 


tte 


ctt 


gtg 


gae aat 


tet 


agt 


gtg 


gae 


520 


Thr 


Phe 


Leu 


Val 


Asp Asn 


Ser 


Ser 


Val 


Asp 












100 








105 




aaa 


gag 


ata 


ctg 


aga atg 


get 


gga 


eca 


etc 


568 


Lys 


Glu 


He 


Leu 


Arg Met 


Ala 


Gly Pro 


Leu 










115 








120 






aag 


att 


egt 


aae 


teg ggc 


tec 


get 


gae 


agt 


616 


Lys 


He 


Arg 


Asn 


Ser Gly 


Ser 


Ala Asp 


Ser 








130 








135 








tat 


caa 


ecc 


ate 


ate cac 


cga 


tgg 


agg 


gag 


664 


Tyr 


Gin 


Pro 


He 


He His 


Arg 


Trp 


Arg 


Glu 






145 








150 










tea 


gea 


ace 


tgt 


gga gga 


ggt 


tat 


eag 


ctg 


712 


Ser 


Ala 


Thr 


Cys 


Gly Gly 


Gly 


Tyr 


Gin 


Leu 
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155 160 165 

aca teg get gag tgc tac gat ctg agg age aac eg 747 
Thr Ser Ala Glu Cys Tyr Asp Leu Arg Ser Asn 
170 175 180 



<210> 48 

<211> 561 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 36. .521 

<221> sig_jpeptide 
<222> 36 . .104 
<223> Von Heijne matrix 
score 7.4 

seq VLLLAALPPVLLP/GA 

'=-;!^<221> polyA^signal 
y!l<222> 528. .533 

i-ik221> polyA_site 
[f|<222> 548. .561 

iij|<400> 48 

yipaegcctctt teagcecggg ategcceeag eaggg atg gge gae aag ate tgg 53 



Met Gly Asp Lys lie Trp 
-20 





ccc 


ttc 


ccc 


gtg 


etc 


ctt 


ctg 


gcc 


get 


ctg 


cct 


eeg 


gtg 


ctg 


ctg 


101 


'stjeu 


Pro 


Phe 
-15 


Pro 


Val 


Leu 


Leu 


Leu 
-10 


Ala 


Ala 


Leu 


Pro 


Pro 
-5 


Val 


Leu 


Leu 




^tbet 


ggg 


gcg 


gee 


ggc 


ttc 


aca 


cct 


tec 


etc 


gat 


age 


gae 


ttc 


aee 


ttt 


149 


fpro 


Gly 

1 


Ala 


Ala 


Gly 


Phe 
5 


Thr 


Pro 


Ser 


Leu 


Asp 
10 


Ser 


Asp 


Phe 


Thr 


Phe 
15 




aee 


ctt 


ccc 


gee 


ggc 


cag 


aag 


gag 


tgc 


ttc 


tac 


cag 


ccc 


atg 


ccc 


ctg 


197 


Thr 


Leu 


Pro 


Ala 


Gly 
20 


Gin 


Lys 


Glu 


Cys 


Phe 
25 


Tyr 


Gin 


Pro 


Met 


Pro 
30 


Leu 




aag 


gee 


teg 


ctg 


gag 


ate 


gag 


tac 


eaa 


gtt 


tta 


gat 


gga 


gea 


gga 


tta 


245 


Lys 


Ala 


Ser 


Leu 


Glu 


He 


Glu 


Tyr 


Gin 


Val 


Leu 


Asp 


Gly Ala Gly 


Leu 










35 










40 










45 








gat 


att 


gat 


ttc 


cat 


ctt 


gcc 


tet 


eca 


gaa 


ggc 


aaa 


acc 


tta 


gtt 


ttt 


293 


Asp 


He 


Asp 
50 


Phe 


His 


Leu 


Ala 


Ser 
55 


Pro 


Glu 


Gly 


Lys 


Thr 
60 


Leu 


Val 


Phe 




gaa 


eaa 


aga 


aaa 


tea 


gat 


gga 


gtt 


eac 


act 


gta 


gag 


act 


gaa 


gtt 


ggt 


341 


Glu 


Gin 


Arg 


Lys 


Ser 


Asp 


Gly 


Val 


His 


Thr 


Val 


Glu 


Thr 


Glu 


Val 


Gly 






65 










70 










75 










gat 


tac 


atg 


ttc 


tgc 


ttt 


gae 


aat 


aca 


ttc 


age 


acc 


att 


tet 


gag 


aag 


389 


Asp 


Tyr 


Met 


Phe 


Cys 


Phe 


Asp 


Asn 


Thr 


Phe 


Ser 


Thr 


He 


Ser 


Glu 


Lys 




80 










85 










90 










95 




gtg 


att 


ttc 


ttt 


gaa 


tta 


ate 


eeg 


gat 


aat 


atg 


gga 


gaa 


cag 


gea 


caa 


437 


Val 


He 


Phe 


Phe 


Glu 
100 


Leu 


He 


Pro 


Asp 


Asn 
105 


Met 


Gly 


Glu 


Gin 


Ala 
110 


Gin 




gaa 


caa 


gaa 


gat 


tgg 


aag 


aaa 


tat 


att 


act 


gge 


aca 


gat 


ata 


ttg 


gat 


485 
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Glu Gin Glu Asp Trp Lys Lys Tyr He Thr Gly Thr Asp He Leu Asp 

115 120 125 

atg aaa ctg gaa gac ate ctg gtc agt atg gtc ttc taataaaata 531 
Met Lys Leu Glu Asp He Leu Val Ser Met Val Phe 

130 135 
aaaattatta acagccaaaa aaaaaaaaaa 561 



<210> 49 

<211> 632 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 36. .395 

<221> sig_peptide 
<222> 36 . . 104 
<223> Von Heijne matrix 
score 7.4 
O seq VLLLAALPPVLLP/GA 

n,|<221> polyA^signal 
m<222> 599. .604 

|is-<221> polyA^site 
r;'^<222> 619. .632 

'^k400> 49 

gacgcctctt tcagcccggg atcgccccag caggg atg ggc gac aag ate tgg 53 

I^et Gly Asp Lys He Trp 

in J -20 

hk:tg ccc ttc ccc gtg etc ctt ctg gee get ctg ect ccg gtg ctg ctg 101 
^..^JLeu Pro Phe Pro Val Leu Leu Leu Ala Ala Leu Pro Pro Val Leu Leu 

liyf ct ggg gcg gee ggc ttc aca ect tec etc gat age gac ttc acc ttt 149 
'"^tro Gly Ala Ala Gly Phe Thr Pro Ser Leu Asp Ser Asp Phe Thr Phe 
15 10 15 

ace ctt ccc gee ggc eag aag gag tgc ttc tac cag ccc atg ccc ctg 197 
Thr Leu Pro Ala Gly Gin Lys Glu Cys Phe Tyr Gin Pro Met Pro Leu 

20 25 30 

aag gee teg ctg gag ate gag tac caa gtt tta gat gga gca gga tta 245 
Lys Ala Ser Leu Glu He Glu Tyr Gin Val Leu Asp Gly Ala Gly Leu 

35 40 45 

gat att gat ttc cat ctt gee tet eca gaa ggc aaa ace tta gtt ttt 293 
Asp He Asp Phe His Leu Ala Ser Pro Glu Gly Lys Thr Leu Val Phe 

50 55 60 

gaa caa aga aaa tea gat gga gtt cac aeg tgt ata aga agt aaa aat 341 
Glu Gin Arg Lys Ser Asp Gly Val His Thr Cys He Arg Ser Lys Asn 

65 70 75 

ggg eca ggc act gcg gtt cac gee tat aat ccc age act ttc ega ggc 389 
Gly Pro Gly Thr Ala Val His Ala Tyr Asn Pro Ser Thr Phe Arg Gly 
80 85 90 95 

caa gtg tagagactga agttggtgat tacatgttct getttgacaa taeattcage 445 
Gin Val 



-32- 



accatttctg agaaggtgat tttctttgaa ttaatcctgg ataatatggg agaacaggca 505 

caaggacaag aagattggaa gaaatatatt actggcacag atatattgga tatgaaactg 565 

gaagacatcc tggtcagtat ggtcttctaa taaaataaaa attattaaca gccaaaaaaa 625 

aaaaaaa 632 



<210> 


50 


<211> 


370 


<AxZ > 


DNA 


<213> 


Homo sapiens 


<220> 




<221> 


CDS 


<222> 


21, .41 


<221> 


polyA_signal 


<222> 


328. .333 


<221> 


polyA_site 


<222> 


357. .370 




50 



|Jifctgggacttc tggcctcaca atg gtt gag atg act ggg gtg tagcagtgcc 51 
FU Met Val Glu Met Thr Gly Val 

N'^agtcgaggc tgtgaaaggc cttccacctt tactctcgtg ctcgtgccct cccccattgt ill 

|5^^ggagaagg gcatgctcag gccagcccat tagcccagga ggaggacaag aaacacacgg 171 

yagcagacaca agccacctca ccaacccagc caaggctgtc ctgaattagc aaccctgaca 231 

j^fgtgtgagca agtccaacgg acaccggaag atccacctag tcaagcccaa ccaagactgg 2 91 

■'bagagctgcc aagctgacca cttaaggcgc atgaggaata aacactcgtt gctgcatgcc 351 

attgcaaaaa aaaaaaaaa 370 



^%|c210> 51 

d;|c211> 994 

:n5^c212> DNA 

'"<213> Homo sapiens 

<220> 
<221> CDS 
<222> 35. .631 

<221> sig_j)eptide 
<222> 35 . .160 
<223> Von Heijne matrix 
score 8 . 6 

seq ASLFLLLSLTVFS/IV 

<221> polyA_signal 
<222> 901. . 906 

<221> polyA_site 
<222> 979. .994 



<400> 51 



-33- 



ataattggag ctgcaaagca gatcgtgaca agag atg gac ggt cag aag aaa aat 55 

Met Asp Gly Gin Lys Lys Asn 
-40 



tgg aag 


gac 


aag 


gtt 


att 


y «^ 




ouy 




tgg 


aga 


gac 


acc 


aag 


aag 


103 


Trp Lys 


Asp 


Lys 


Val 


Val 




T .oi 1 


Leu 


iyr 


Trp Arg Asp 


lie 


Lys 


Lys 




-35 








- n 










-25 










-20 




act gga gtg gtg 


ttt 


crcrt* 


gee 


ciy ^ 






Ctg 


ctg 


ctt 


tea 


ttg 


aca 


151 


Thr Gly Val Val 




vjjL y 




G £a V 

ocr 


Leu 


Fne 


Leu 


Leu 


Leu 


Ser 


Leu 


Thr 










J. ~J 










_ 1 n 
- J.U 










-5 






gta ttc 


age 


att 


y t-g 


ciy t_ 


yea 


aca 


gee 


■t- -3 
uac 


att 


gee 


ttg 


gee 


etg 


etc 


199 


Val Phe 


Ser 


He 
1 


Val 


Q or* 
OCX. 


V ciX 


Thr 


Til a 

Ax a 


Tyr 


He 


Ala 


Leu 


Ala 


Leu 


Leu 




tct gtg 


acc 


ate 


ciy o 


U U L. 


agg 


t- a 


uae 


aag 


ggt 


gtg 


10 
ate 


eaa 


get 


ate 


247 


Ser Val 


Thr 


He 


O^JL 




Arg 


±±e 


Tyr 


Lys 


Gly Val 


He 


Gin 


Ala 


He 




15 




















25 












cag aaa 


tea 


gat 


gaa 


ggc 


eae 


eca 


■1- 4- y-t 

uCC 


agg 


gca 


tat 


ctg 


gaa 


tct 


gaa 


295 


Gin Lys 


Ser 


Asp 


Lt±U 


o±y 


TT -J — 

nlS 


Pro 


Phe 


Arg 


Ala 


Tyr 


Leu 


Glu 


Ser 


Glu 




30 








35 










40 










45 




gtt get 


ata 


tct 


gag 


gag 


ttg 


gtt 


cag 


aag 


tac 


agt 


aat 


tct 


get 


ctt 


343 


Val Ala 


He 


Ser 


GlU 


Glu 


Leu 


Val 


Gin 


Lys 


Tyr 


Ser 


Asn 


Ser 


Ala 


Leu 










50 










55 










60 






,„, qgt cat 


gtg 


aac 


tgc 


aeg 


aca 


aag 


gaa 


etc 


agg 


cge 


etc 


ttc 


tta 


gtt 


391 


^^^ifGly His Val Asn 


Cys 


Thr 


He 


Lys 


Glu 


Leu 


Arg Arg 


Leu 


Phe 


Leu 


Val 








65 










70 










75 








i%a-t gat 


tta 


gtt 


gat 


ecu 


etg 


aag 


ttt 


gca 


gtg 


ttg 


atg 


tgg 


gta 


ttt 


439 


NiAsp Asp Leu Val 


Asp 


Ser 


Leu 


Lys 


Phe 


Ala 


Val 


Leu 


Met 


Trp Val 


Phe 






80 










85 










90 










i;^cc tat 


gtt 


ggt 


gee 


t. og 


-f- 4- -1- 


aat 


ggt 


ctg 


aca 


eta 


Ctg 


att 


ttg 


get 


487 


isfThr Tyr Val 


Gly 


Ai.a 


Leu 


irne 


Asn Gly Leu 


Thr 


Leu 


Leu 


He 


Leu 


Ala 




ill 95 










100 










105 












btc att 


tea 


etc 


ttc 


agt 


gtt 


cet 


gtt 


att 


tat 


gaa 


egg 


cat 


cag 


gca 


535 


Leu lie 


Ser 


Leu 


Phe 


Ser 


Val 


Pro 


Val 


He 


Tyr 


Glu 


Arg 


His 


Gin 


Ala 




i:^>io 








115 










120 








125 




^^ifcag ata 


gat 


cat 


tat 


eta 


gta 


ctt 


gca 


aat 


aag 


aat 


gtt 


aaa 


gat 


get 


583 


r^<31n lie Asp His 


Tyr 


Leu 


Val 


Leu 


Ala 


Asn 


Lys 


Asn 


Val 


Lys Asp Ala 










130 










135 










140 






:||ptg get 


aaa 


ate 


eaa 


gca 


aaa 


ate 


cct 


gga 


ttg 


aag 


cge 


aaa 


get 


gaa 


631 


iy|1et Ala Lys 


He 


Gin 


Ala 


Lys 


He 


Pro Gly 


Leu 


Lys 


Arg 


Lys 


Ala 


Glu 








145 










150 










155 









tgaaaacgee caaaataatt agtaggagtt catctttaaa ggggatatte atttgattat 691 

acgggggagg gteagggaag aaegaacett gacgttgeag tgcagtttca cagategttg 751 

ttagatcttt atttttagee atgeactgtt gtgaggaaaa attacctgte ttgactgeca 811 

tgtgtteatc atcttaagta ttgtaagctg ctatgtatgg atttaaaccg taatcatatc 871 

tttttectat etatctgagg caetggtgga ataaaaaace tgtatatttt actttgttgc 931 

agatagtctt gcegcatett ggeaagttgc agagatggtg gagctagaaa aaaaaaaaac 991 



<210> 52 

<211> 412 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 271. .399 
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<400> 52 

gccgctagcg cctcgagcga tgcacctcct ttccaactgg gcaaaccccg cttccagcag 60 
acgtccttct atggccgctt caggcacttc ttggatatca tcgaccctcg cacactcttt 12 0 
gtcactgaga gacgtctcag agaggctgtg cagctgctgg aggactataa gcatgggacc 180 
ctgcgcccgg gggtcaccaa tgaacagctc tggagtgcac agaaaatcaa gcaggctatt 240 
ctacatccgg acaccaatga gaagatcttc atg cca ttt aga atg tea ggt tat 294 

Met Pro Phe Arg Met Ser Gly Tyr 

1 5 

att cct ttt ggg acg cca att gta agt gtt acc ttc aaa gga ttt cct 342 
He Pro Phe Gly Thr Pro He Val Ser Val Thr Phe Lys Gly Phe Pro 

10 15 20 

ttt eta aaa aat tat ttt aaa tgt eta act tta tgt tat tge tea egg 390 
Phe Leu Lys Asn Tyr Phe Lys Cys Leu Thr Leu Cys Tyr Cys Ser Arg 
25 30 35 40 

gta ttt gae tgaattgttg att 4x2 
Val Phe Asp 



<210> 


53 


.„,<211> 


597 


C!fe212> 


DNA 


^k2l3> 


Homo sapiens 


h^220> 




f,fK221> 


CDS 


,ip=222> 


103 . .252 


;.J:221> 


sig_peptide 


^^^k222> 


103 . .213 


; <223> 


Von Heijne matrix 




score 3.9 




seq PGPSLRLFSGSQA/SV 


Sk221> 


polyA_site 


,f|c222> 


588 . .597 




53 



gaaaggteag aggaaggage tgtgggaagc tegeagcagg tateggaget taageeagtg 60 
gatttggggg ccetgggcte ectagcegge tgeggtgtga ga atg gag tgg gca 114 

Met Glu Trp Ala 
-35 

gga aag cag egg gac ttt eag gta agg gea get ceg gge tgg gat eat 162 
Gly Lys Gin Arg Asp Phe Gin Val Arg Ala Ala Pro Gly Trp Asp His 

-30 -25 -20 

ttg gee tec ttt cct gge cet tet etc egg etg ttt tct ggg agt cag 210 
Leu Ala Ser Phe Pro Gly Pro Ser Leu Arg Leu Phe Ser Gly Ser Gin 

-15 -10 -5 

gcg agt gtc tgt agt etc tgc teg ggg ttt ggg get cag gaa 252 
Ala Ser Val Cys Ser Leu Cys Ser Gly Phe Gly Ala Gin Glu 

15 10 
tgatgtcatg etecaaeagt tggattetat tagettaagg aggagggaaa cagceaattt 312 
tettgaettt geaaatctag ctgatctcac tettgetgaa tetgaggtgt ttagacttea 372 
etetaaaaag catcatttta ettttattta geaeaaaggc acaggatatt tttaeaggaa 432 
gaatctttta tatggaaaaa tctgagttaa catcactcee gtggtgtttg tagttcttae 492 
agggaaaete eagtgeettt tgageegctt gttegtecta gtgaaeactg tetgttttgt 552 



-35- 

ctcttggtgc tgctatgtct gacctgtaat gggagaaaaa aagaa 597 



<210> 54 

<211> 748 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 2. .460 

<221> polyA_signal 
<222> 713. .718 

<221> polyA_site 
<222> 735 . . 748 

<400> 54 

c aca gtt cct etc etc eta gag cat gcc gac cat gcc cgc ggg cgt gcc 49 

Thr Val Pro Leu Leu Leu Glu Pro Ala Asp His Ala Arg Gly Arg Ala 
I ^ 5 10 15 



yiicat 


gtc 


eae 


eta 


cct 


gaa 


aat 


gtt 


cgc 


age 


cag 


tct 


cct 


ggc 


eat 


gtg 


97 


[=jHis 


Val 


His 


Leu 
20 


Pro 


Glu 


Asn 


Val 


Arg 
25 


Ser 


Gin 


Ser 


Pro 


Gly 
30 


His 


Val 




yicgc 


agg 


ggc 


aga 


agt 


ggt 


gca 


cag 


gta 


eta 


ceg 


ace 


gga 


cct 


gat 


gag 


145 




Arg 


Gly 


Arg 


Ser 


Gly 


Ala 


Gin 


Val 


Leu 


Pro 


Thr 


Gly 


Pro 


Asp 


Glu 








35 










40 










45 








jli?aaa 


cag 


gtt 


gag 


aag 


agt 


gaa 


gtt 


gat 


ttc 


tea 


aag 


tea 


cat 


age 


tta 


193 


' ' "Lys 


Gin 
50 


Val 


Glu 


Lys 


Ser 


Glu 
55 


Val 


Asp 


Phe 


Ser 


Lys 
60 


Ser 


His 


Ser 


Leu 




t'^tg 


aga 


cga 


ttt 


gag 


gat 


Ctg 


aag 


cee 


aag 


ctt 


tct 


gtt 


tgc 


aaa 


act 


241 




Arg 


Arg 


Phe 


Glu 


Asp 


Leu 


Lys 


Pro 


Lys 


Leu 


Ser 


Val 


Cys 


Lys 


Thr 














70 










75 






80 






tea 


eaa 


gtc 


ttt 


egg 


teg 


gag 


aae 


tgg 


aag 


gtc 


tgg 


gca 


gag 


teg 


289 




Ser 


Gin 


Val 


Phe 


Arg 


Ser 


Glu 


Asn 


Trp 


Lys 


Val 


Trp 


Ala 


Glu 


Ser 












85 










90 










95 






''age 


aga 


gga 


gac 


cat 


gat 


gac 


tgc 


eta 


gac 


ttg 


tgc 


tea 


gtg 


ctg 


tgt 


337 


Ser 


Arg 


Gly 


Asp 


His 


Asp 


Asp 


Cys 


Leu 


Asp 


Leu 


Cys 


Ser 


Val 


Leu 


Cys 










100 










105 










110 






tgg 


gga 


gaa 


ctg 


eta 


egg 


aca 


ata 


cct 


gaa 


att 


cea 


cea 


aag 


cgt 


gga 


385 


Trp 


Gly 


Glu 


Leu 


Leu 


Arg 


Thr 


He 


Pro 


Glu 


He 


Pro 


Pro 


Lys 


Arg 


Gly 








115 










120 










125 






gaa 


etc 


aaa 


aeg 


gag 


ctt 


ttg 


gga 


ctg 


aaa 


gaa 


aga 


aaa 


eac 


aaa 


cct 


433 


Glu 


Leu 


Lys 


Thr 


Glu 


Leu 


Leu 


Gly 


Leu 


Lys 


Glu 


Arg 


Lys 


His 


Lys 


Pro 






130 










135 










140 










eaa 


gtt 


tct 


eaa 


cag 


gag 


gaa 


ctt 


aaa 


taaetatgec aagaattctg 




480 


Gin 


Val 


Ser 


Gin 


Gin 


Glu 


Glu 


Leu 


Lys 


















145 










150 























tgaataatat aagtcttaaa tatgtatttc ttaatttatt gcatcaaact acttgtcctt 540 

aageacttag tctaatgeta aetgeaagag gaggtgctea gtggatgttt agccgatacg 6 00 

ttgaaattta attacggttt gattgatatt tcttgaaaac cgceaaagea eatatcatca 660 

aaccatttea tgaatatggt ttggaagatg tttagtettg aatataatge gaaatagaat 72 0 

atttgtaagt ctaccaaaaa aaaaaaaa 740 
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<210> 55 

<211> 703 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 31, .231 

<221> polyA_signal 
<222> 769. .774 

<221> polyA_site 
<222> 690. .703 

<400> 55 

ctctggtggc tctgctacgg cggcgcagaa atg agg cag aag egg aaa gga gat 54 

Met Arg Gin Lys Arg Lys Gly Asp 
1 5 

etc age cat get aag etg atg atg ctg act ata gga gat gtt att aaa 102 
Leu Ser Pro Ala Lys Leu Met Met Leu Thr He Gly Asp Val He Lys 
U 10 15 20 

j^aa ctg att gaa gee cac gag eag ggg aaa gac ate gat eta aat aag 150 

rifeln Leu He Glu Ala His Glu Gin Gly Lys Asp He Asp Leu Asn Lys 

H25 30 35 40 

ifptg aga ace aag aca get gee aaa tat gge ett tct gee cag ecc cgc 198 

,j;yal Arg Thr Lys Thr Ala Ala Lys Tyr Gly Leu Ser Ala Gin Pro Arg 

iSj ^0 55 

n'ftg gtg gat ate att get gee gtc ect ect gag tagctgggat tacaggcacc 251 

'Leu Val Asp He He Ala Ala Val Pro Pro Glu 

|:;^cgecgetgcc aatttttgta tttttagtag ggatgggggt ttcaecatat tggtcaggct 311 
Mi^gtctcgaac tcetgaectc aggtgatcaa cccaccttgg ectccctaaa tgcegggatt 371 
ii^^%caggeatga geeaccgete egggectttg attttttaag gtggattttg gttgttataa 431 
^^.fetggagaaag gtaagagttc aagtteaacc cgtgtgtgaa ageaaaacaa tggaaaacag 491 
Ipattggettc ttcaaagget eetcttgtag aactgcetet ttgaaatttc gaggtaatet 551 
V^ctttggaga ctetgeetgg agagggtcag ttcetaagtt aaaageateg cttaaccttg 611 
gctcctgtgg cattttaeaa aggtttaaag gaattgattc ctetgaaagg gcetgaaaat 671 
aaaaagtett taaeataeaa aaaaaaaaaa aa 703 



<210> 56 

<211> 725 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 305. .565 

<221> polyA_signal 
<222> 694. .699 

<221> polyA_site 
<222> 713 . . 725 
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<400> 56 

ctcacggtgg tgaaggtcac agggttgcag cactcccagt agaccaggag ctccgggagg 60 

cagggccggc cccacgtcct ctgcgcacca ccctgagttg gatcctctgt gcgccacccc 120 

tgagttggat ccagggctag ctgctgttga cctccccact cccacgctgc cctcctgcct 180 

gcagccatga cgcccctgct caccctgatc ctggtggtcc tcatgggctt acctctggcc 240 

caggccttgg actgccacgt gtgaggacta caaatccctc caggatatca ttgccatcct 300 

gggt atg gat gaa ctt tct gag gaa gac aag ttg acc gtg tec cgt gca 34 9 
Met Asp Glu Leu Ser Glu Glu Asp Lys Leu Thr Val Ser Arg Ala 





1 








5 










10 










15 




egg 


aaa 


ata 


cag 


cgt 


ttc 


ttg 


tct 


cag 


cea 


ttc 


cag 


gtt 


get 


gag 


gtc 


397 


Arg 


Lys 


He 


Gin 


Arg 
20 


Phe 


Leu 


Ser 


Gin 


Pro 
25 


Phe 


Gin 


Val 


Ala 


Glu 
30 


Val 




ttc 


aca 


ggt 


cat 


atg 


ggg 


aag 


ctg 


gta 


ccc 


ctg 


aag 


gag 


acc 


ate 


aaa 


445 


Phe 


Thr 


Gly 


His 


Met 


Gly 


Lys 


Leu 


Val 


Pro 


Leu 


Lys 


Glu 


Thr 


He 


Lys 










35 










40 










45 






gga 


ttc 


cag 


cag 


att 


ttg 


gca 


ggt 


gaa 


tat 


gac 


cat 


etc 


cca 


gaa 


cag 


493 


Gly 


Phe 


Gin 
50 


Gin 


He 


Leu 


Ala 


Gly 
55 


Glu 


Tyr 


Asp 


His 


Leu 
60 


Pro 


Glu 


Gin 




gcc 


ttc 


tat 


atg 


gtg 


gga 


ccc 


att 


gaa 


gaa 


get 


gtg 


gca 


aaa 


get 


gat 


541 


Ala 


Phe 


Tyr 


Met 


Val 


Gly 


Pro 


He 


Glu 


Glu 


Ala 


Val 


Ala 


Lys 


Ala 


Asp 






65 










70 










75 








i:=;Raag 


ctg 


get 


gaa 


gag 


cat 


tea 


teg 


tgaggggtct ttgtcctctg tactgtctct 


595 


JllLys 


Leu 


Ala 


Glu 


Glu 


His 


Ser 


Ser 





















nJsO 85 



Mctccttgecc etaacceaaa aagcttcatt tttctgtgta ggetgcacaa gagcettgat 655 
ij^ltgaagatata ttctttctga acagtattta aggtttceaa taaagtgtac acccctcaaa 715 
^l^aaaaaaaaaa -725 



:;\<210> 57 
^f^<211> 1705 
nj<212> DNA 
ii-ik213> Homo sapiens 

;;ri<22o> 

fff:221> CDS 
''<222> 124. ,873 

<221> sig_j)eptide 
<222> 124 . .378 
<223> Von Heijne matrix 
score 3 . 6 

seq HLSWTLAAKVKC/IP 

<221> polyA_signal 
<222> 1673 . . 1678 

<221> polyA_site 
<222> 1694 . .1705 

<400> 57 

cggaggtgag gagcggegge ceegcceggt gcgetggagg tcgaagctte caggtagegg 60 
cccgcagagc etgaeccagg etctggaeat cetgagceca agteccccac actcagtgca 120 
gtg atg agt gcg gaa gtg aag gtg aca ggg cag aac cag gag caa ttt 168 
Met Ser Ala Glu Val Lys Val Thr Gly Gin Asn Gin Glu Gin Phe 



-38- 



360 



408 



504 



-85 -80 -75 

ctg etc eta gcc aag teg gcc aag ggg gca gcg ctg gcc aca etc ate 216 
Leu Leu Leu Ala Lys Ser Ala Lys Gly Ala Ala Leu Ala Thr Leu He 
-70 -65 -60 -55 

cat cag gtg ctg gag gee cct ggt gtc tac gtg ttt gga gaa ctg ctg 264 
His Gin Val Leu Glu Ala Pro Gly Val Tyr Val Phe Gly Glu Leu Leu 

-50 -45 -40 

gac atg cee aat gtt aga gag ctg naa gee egg aat ctt cct eca eta 312 
Asp Met Pro Asn Val Arg Glu Leu Xaa Ala Arg Asn Leu Pro Pro Leu 

-35 -30 -25 

aca gag get cag aag aat aag ett cga eac etc tea gtt gtc ace ctg 
Thr Glu Ala Gin Lys Asn Lys Leu Arg His Leu Ser Val Val Thr Leu 

-20 -15 -10 

get get aaa gta aag tgt ate eca tat gca gtg ttg ctg gag get ett 
Ala Ala Lys Val Lys Cys He Pro Tyr Ala Val Leu Leu Glu Ala Leu 

-5 15 10 

gee ctg cgt aat gtg egg cag ctg gaa gac ctt gtg att gag get gtg 456 
Ala Leu Arg Asn Val Arg Gin Leu Glu Asp Leu Val He Glu Ala Val 

15 20 25 

tat get gae gtg ett cgt gge tec ctg gac eag ege aac cag egg etc 
Tyr Ala Asp Val Leu Arg Gly Ser Leu Asp Gin Arg Asn Gin Arg Leu 
30 35 40 

'";§ag gtt gac tac age ate ggg egg gac ate cag cgc cag gae etc agt 552 
;l431u Val Asp Tyr Ser He Gly Arg Asp He Gin Arg Gin Asp Leu Ser 

45 50 55 

k^cc att gee ega ace ctg cag gaa tgg tgt gtg gge tgt gag gtc gtg 
yl^la He Ala Arg Thr Leu Gin Glu Trp Cys Val Gly Cys Glu Val Val 

60 65 70 

UPtg tea gge att gag gag cag gtg age cgt gee aac caa eac aag gag 
P|Leu Ser Gly He Glu Glu Gin Val Ser Arg Ala Asn Gin His Lys Glu 
*^"75 80 85 90 

■^^^eag eag ctg gge ctg aag eag eag att gag agt gag gtt gee aac ett 
:;'i31n Gin Leu Gly Leu Lys Gin Gin He Glu Ser Glu Val Ala Asn Leu 
rf 95 100 105 

-'^aa aaa ace att aaa gtt acg acg gca gca gca gcc gca gcc aca tct 
%Lys Lys Thr He Lys Val Thr Thr Ala Ala Ala Ala Ala Ala Thr Ser 
HI 110 115 120 

[ipag gac cct gag caa eac ctg act gag ctg agg gaa eca get cct gge 792 
Gin Asp Pro Glu Gin His Leu Thr Glu Leu Arg Glu Pro Ala Pro Gly 

125 130 135 

ace aac cag cgc cag cce age aag aaa gcc tea aag gge aag ggg etc 
Thr Asn Gin Arg Gin Pro Ser Lys Lys Ala Ser Lys Gly Lys Gly Leu 

140 145 150 

cga ggg age gee aag att tgg tee aag teg aat tgaaagaaet gtcgttteet 893 
Arg Gly Ser Ala Lys He Trp Ser Lys Ser Asn 
155 160 165 

cectggggat gtggggtcee agetgectge ctgectetta ggagtcctea gagagecttc 953 
tgtgcceetg gccagctgat aatcctaggt teatgaccct tcacctecce taaccccaaa 1013 
catagatcac acettcteta gggaggagtc aaatgtaggt catgtttttg ttggtaettt 1073 
etgttttttg tgacttcatg tgtteeattg cteccegctg ceatgetcte tceettgttt 1133 
ccttaagage teagcatetg tecctgttea ttacatgtea ttgagtaggt gggtagccet 1193 
gatgggggtc getctgtctg gagcataaec eacaggcgtt ttttctgcea ecccatccct 1253 
gcatgcctga tceeeagttc etatacccta cccetgacct attgageagc etetgaagag 1313 
ecatagggec cecaecttta eteacacect gagaattetg ggagecagtc tgceatgeca 1373 
ggagtcactg gaeatgttea tectagaatc ctgtcacaet acagtcattt cttttcctet 1433 
ctetggccct tgggtcetgg gaatgetgct gcttcaacce cagagcetaa gaatggeage 14 93 
cgtttettaa eatgttgaga gatgattett tcttggecct ggceatcteg ggaagcttga 1553 



600 



648 



696 



744 



840 
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tggcaatcct ggaagggttt aatctccttt tgtgagtttg gtggggaagg gaagggtata 1613 
tagattatat taaaaaaaaa aaggtatata tgcatatatc tatatataat atgacgcaga 1673 
aataaatcta tgagaaatcc aaaaaaaaaa aa 1705 



<210> 58 

<211> 1069 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 135. .206 

<221> polyA_signal 
<222> 850. .855 

<221> polyA_site 
<222> 1056 . .1069 

,^^^^<400> 58 

W%;ccactccgc tctcacgact aagctctcac gattaaggca cgcctgcctc gattgtccag 60 
yJcctctgccag aagaaagctt agcagccagc gcctcagtag agacctaagg gcgctgaatg 120 
lllagtgggaaag ggaa atg ccg acc aat tgc get gcg gcg ggc tgt gcc act 170 
¥^ Met Pro Thr Asn Cys Ala Ala Ala Gly Cys Ala Thr 

IJ1 15 10 

j;~acc tac aac aag cac att aac ate age ttc cac agg taacctgggc 216 
hiVhr Tyr Asn Lys His He Asn He Ser Phe His Arq 

m 15 

'agggagtggg ggtgacggaa 

caggaggatt ctgetaattc 
H^^^etttgtgtt tctcaaatct 
i ^Itetaaaagaa atecaagtac 
|%atatcgtaa ttttetactg 
'^^^,|ccgtaaggga gaaagagatc 
yjigttaaaatta ggaaattctc 
rj^ptcaeatagt aattctgcgg 
'"'"gttaattaaa gtgcaaacga 

cctaggtttt cttttgtttg 

aactettgaa agcactttga 

cttctctgca agtgcctgcc 

ccagtcctga ttttatatat 

atatccttga cctttttccc 

aaaaaaaaaa aaa 



20 

actggagttc etattgtggc tatcgettgt gtggaaggaa 276 

taataacttt cccagctggt agcagggaag catcgtatgt 336 

gcccaattgt tctctgcttt eggggaaget ttactcattt 396 

tgtttggtca ttacccctta gtaaaaaaaa gtaacaggag 456 

ttttattcet etgttagacc gggccttgae atgaatgacg 516 

ttcccaatea gcaatcaccg taaaagcctg ctgtgttccc 576 

actagatgaa ttgacatggg aggcatttag atttctaata 636 

aggaattgag tcatctttga tagccatgga attaagcgat 696 

taacctttct gttcttacta gaatagagta ataaaaagaa 756 

ctggaagaaa aatcaaaatt ctttagttct gtcaaaecag 816 

acaatgcctg gaaaataaca ggtactetgt aaatgtttac 876 

acgtgcccga agaaaagaca cattaaaaag ttaagtgaca 936 

tttatatacc taaeaacgta tatgttagta tgtagaaatt 996 

taectattac gaaetgtact tttattaaaa gctgccacta 1056 

1069 



<210> 59 

<211> 1084 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 135. .818 



-40- 



<221> polyA__signal 
<222> 909. .914 

<221> polyA__site 
<222> 1071. .1084 

<400> 59 

cccactccgc tctcacgact aagctctcac gattaaggca cgcctgcctc gattgtccag 60 
cctctgccag aagaaagctt agcagccagc gcctcagtag aggcctaagg gcgctgaatg 12 0 
agtgggaaag ggaa atg ccg acc aat tgc get gcg gcg ggc tgt gcc act 170 
Met Pro Thr Asn Cys Ala Ala Ala Gly Cys Ala Thr 
15 10 
acc tac aac aag cac att aac ate age ttc cac agg ttt cct ttg gat 218 
Thr Tyr Asn Lys His He Asn He Ser Phe His Arg Phe Pro Leu Asp 

15 20 25 

cct aaa aga aga aaa gaa tgg gtt cgc ctg gtt agg cgc aaa aat ttt 266 
Pro Lys Arg Arg Lys Glu Trp Val Arg Leu Val Arg Arg Lys Asn Phe 

30 35 40 

gtg cca gga aaa cac act ttt ctt tgt tea aag cac ttt gaa gcc tec 314 
Val Pro Gly Lys His Thr Phe Leu Cys Ser Lys His Phe Glu Ala Ser 
45 50 55 60 

^^^^tgt ttt gac eta aca gga caa act cga cga ctt aaa atg gat get gtt 362 
^jifcys Phe Asp Leu Thr Gly Gin Thr Arg Arg Leu Lys Met Asp Ala Val 
IJ 65 70 75 

1,|cca acc att ttt gat ttt tgt acc cat ata aag tct atg aaa etc aag 410 
^=ibPro Thr He Phe Asp Phe Cys Thr His He Lys Ser Met Lys Leu Lys 
J1 80 85 90 

If ^99 aat ctt ttg aag aaa aac aac agt tgt tct cca get gga cca 458 
; per Arg Asn Leu Leu Lys Lys Asn Asn Ser Cys Ser Pro Ala Gly Pro 

95 100 105 

'tct agt tta aaa tea aac att agt agt cag caa gta eta ctt gaa cac 506 
•Ser Ser Leu Lys Ser Asn He Ser Ser Gin Gin Val Leu Leu Glu His 
1: 110 115 120 

i,fege tat gcc ttt agg aat cct atg gag gca aaa aag agg ate att aaa 554 
^mer Tyr Ala Phe Arg Asn Pro Met Glu Ala Lys Lys Arg He He Lys 
#25 130 135 140 

||:;tg gaa aaa gaa ata gca age tta aga aga aaa atg aaa act tgc eta 602 
Glu Lys Glu He Ala Ser Leu Arg Arg Lys Met Lys Thr Cys Leu 
145 150 155 

caa aag gaa cgc aga gca act cga aga tgg ate aaa gee atg tgt ttg 650 
Gin Lys Glu Arg Arg Ala Thr Arg Arg Trp He Lys Ala Met Cys Leu 

160 165 170 

gta aag aat tta gaa gca aat agt gta tta cct aaa ggt aca tea gaa 698 
Val Lys Asn Leu Glu Ala Asn Ser Val Leu Pro Lys Gly Thr Ser Glu 

175 180 185 

cac atg tta cca act gee tta age agt ctt cct ttg gaa gat ttt aag 746 
His Met Leu Pro Thr Ala Leu Ser Ser Leu Pro Leu Glu Asp Phe Lys 

190 195 200 

ate ctt gaa caa gat caa caa gat aaa aca ctg eta agt eta aat eta 794 
He Leu Glu Gin Asp Gin Gin Asp Lys Thr Leu Leu Ser Leu Asn Leu 
205 210 215 220 

aaa cag ace aag agt ace ttc att taaatttage ttgcacagag cttgatgcct 848 
Lys Gin Thr Lys Ser Thr Phe He 
225 

atccttcatt etttteagaa gtaaagataa ttatggcaet tatgecaaaa ttcattattt 908 
aataaagttt taettgaagt aacattactg aatttgtgaa gacttgatta caaaagaata 968 
aaaaacttca tatggaaatt ttatttgaaa atgagtggaa gcgeettaea ttagaattac 1028 



-41- 

ggacttaaaa attttgctaa taaattgtgt gtttgaaagg tgaaaaaaaa aaaaaa 1084 



<210> 60 

<211> 419 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 33. .290 

<221> sig_jpeptide 
<222> 33. .92 
<223> Von Heijne matrix 
score 5.4 

seq WFVHSSALGLVLA/PP 
<400> 60 

aatggtaggc cttcatgtga gccagttact ac atg aat ctt cat ttc cca cag 53 

Met Asn Leu His Phe Pro Gin 
U -20 -15 

ijjtgg ttt gtt cat tea tea gcg tta ggc ttg gtc ctg get cca eet ttc 101 
nlTrp Phe Val His Ser Ser Ala Leu Gly Leu Val Leu Ala Pro Pro Phe 

-10 -5 1 

yitcc tct ccg ggc act gac ecc ace ttt ccg tgt att tac tgt agg eta 149 
j;5Ser Ser Pro Gly Thr Asp Pro Thr Phe Pro Cys He Tyr Cys Arg Leu 

/p^tta aat atg ate atg ace cgc ctt gca ttt tea ttc ate ace tgt tta 197 

'' %eu Asn Met He Met Thr Arg Leu Ala Phe Ser Phe He Thr Cys Leu 

\20 25 30 35 

;;^^ge cca aat tta aag gaa gtt tgt etc att ttg cca gaa aaa aat tgt 245 

i^lCys Pro Asn Leu Lys Glu Val Cys Leu He Leu Pro Glu Lys Asn Cys 

40 45 50 

;^.^aat agt egg cac get gga ttt gta ggg cca gca aaa ttg egg cag 290 
ffisn Ser Arg His Ala Gly Phe Val Gly Pro Ala Lys Leu Arg Gin 
31 ^0 65 

''"tgaaactagt tteacttcta aagcccttea tttceeacaa ggttaagete tcgaaacccc 350 
atttgatect tggttcctat ttcgatcetc etttggaatc tgaaaategg tcteeatgtt 410 
gtatgeaaa 



<210> 61 

<211> 682 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 485. .616 

<221> polyA_site 
<222> 669. .682 

<400> 61 



-42- 

ctcctttctc attccttatc ttgcgtgttt ttaccttttt ttcataacta agtttttgag 60 
gaagttagtg ttcttttcaa agaaccggtt cgaaatgtac ttttctttgc tactttttgt 12 0 
tattttattg atcacatctt taatcttttg ttctctatac gtggcctgtt ttgatttatt 180 
ttactattct tgctttctaa ggtaagtatt ttgttgtgta gtgctttatt tttttcatct 240 
ttcttcttga ataataatga catttttagg ttataaattt tcctctggta ctcagtttgc 300 
ctcattaatt ttggcagtaa gcattctcct tttattgctt tctatgtagt ctttaatttt 360 
gcttttaact tcttctttga tctaaggatt acctacttgt taatttccaa atattatctt 420 
atctatctat ctatctatct atctatctat ctatctatct acctatgtga gacgaagtct 480 
ggct atg teg ccg agg ctg gag tgc agt ggt gca ate ttg get cac tgc 529 
Met Ser Pro Arg Leu Glu Cys Ser Gly Ala lie Leu Ala His Cys 
15 10 15 

aac ccc cgc etc eea ggt tea agt tat tct eet gee tea get aet tgg 577 
Asn Pro Arg Leu Pro Gly Ser Ser Tyr Ser Pro Ala Ser Ala Thr Trp 

20 25 30 

gtg aga gga tec ctt gag ccg ggg agg ttg agg ctg cag tgagccataa 626 
Val Arg Gly Ser Leu Glu Pro Gly Arg Leu Arg Leu Gin 

35 40 
ecactactct ecagcctgga taacaaaagt gagaetctga ccaaaaaaaa aaaaaa 682 



_^<210> 62 

C|:^211> 1191 

y;^k212> DNA 

il|<213> Homo sapiens 

[rK220> 
J^221> CDS 
;ij<222> 54. .995 

'=''<221> sig_peptide 

<222> 54. .227 
J;^'^223> Von Heijne matrix 
I ^1 score 4 . 1 

H seq LVHHCPTWQWATG/EE 

:Jj<221> polyA_signal 
f]^222> 1130. .1135 

<221> polyA__site 
<222> 1181. .1191 

<400> 62 

cacggetgca etttccatcc egtcgcgggg ceggcegcta ctccggccce agg atg 56 



Met 



cag 


aat 


gtg 


att 


aat 


aet 


gtg 


aag 


gga 


aag 


gca 


ctg 


gaa 


gtg 


get 


gag 


104 


Gin 


Asn 


Val 

-55 


He 


Asn 


Thr 


Val 


Lys 
-50 


Gly 


Lys 


Ala 


Leu 


Glu 
-45 


Val 


Ala 


Glu 




tac 


ctg 


acc 


ccg 


gte 


etc 


aag 


gaa 


tea 


aag 


ttt 


agg 


gaa 


aca 


ggt 


gta 


152 


Tyr 


Leu 


Thr 


Pro 


Val 


Leu 


Lys 


Glu 


Ser 


Lys 


Phe 


Arg 


Glu 


Thr 


Gly Val 






-40 










-35 










-30 












att 


acc 


eea 


gaa 


gag 


ttt 


gtg 


gca 


get 


gga 


gat 


cac 


eta 


gte 


cac 


cac 


200 


He 


Thr 


Pro 


Glu 


Glu 


Phe 


Val 


Ala 


Ala 


Gly 


Asp 


His 


Leu 


Val 


His 


His 




-25 










-20 










-15 










-10 




tgt 


cea 


aca 


tgg 


caa 


tgg 


get 


aca 


ggg 


gaa 


gaa 


ttg 


aaa 


gtg 


aag 


gca 


248 


Cys 


Pro 


Thr 


Trp 


Gin 


Trp 


Ala 


Thr 


Gly 


Glu 


Glu 


Leu 


Lys 


Val 


Lys 


Ala 





-5 15 
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tac eta cca aca ggc aaa caa ttt ttg gta acc aaa aat gtg ccg tgc 296 
Tyr Leu Pro Thr Gly Lys Gin Phe Leu Val Thr Lys Asn Val Pro Cys 

10 15 20 

tat aag egg tgc aaa cag atg gaa tat tea gat gaa ttg gaa get ate 344 
Tyr Lys Arg Cys Lys Gin Met Glu Tyr Ser Asp Glu Leu Glu Ala He 

25 30 35 

att gaa gaa gat gat ggt gat gge gga tgg gta gat aca tat cac aac 392 
He Glu Glu Asp Asp Gly Asp Gly Gly Trp Val Asp Thr Tyr His Asn 
40 45 50 55 

aca ggt att aca gga ata aeg gaa gee gtt aaa gag ate aca etg gaa 440 
Thr Gly He Thr Gly He Thr Glu Ala Val Lys Glu He Thr Leu Glu 

60 65 70 

aat aag gae aat ata agg ett caa gat tgc tea gea eta tgt gaa gag 488 
Asn Lys Asp Asn He Arg Leu Gin Asp Cys Ser Ala Leu Cys Glu Glu 

75 80 85 

gaa gaa gat gaa gat gaa gga gaa get gea gat atg gaa gaa tat gaa 536 
Glu Glu Asp Glu Asp Glu Gly Glu Ala Ala Asp Met Glu Glu Tyr Glu 

90 95 100 

gag agt gga ttg ttg gaa aca gat gag get acc eta gat aca agg aaa 584 
Glu Ser Gly Leu Leu Glu Thr Asp Glu Ala Thr Leu Asp Thr Arg Lys 
105 110 115 

.,,,^ata gta gaa get tgt aaa gee aaa act gat get gge ggt gaa gat get 632 
iJlle Val Glu Ala Cys Lys Ala Lys Thr Asp Ala Gly Gly Glu Asp Ala 
^;jJl20 125 130 135 

vMtt ttg caa acc aga act tat gae ett tac ate act tat gat aaa tat 680 
h&IlB Leu Gin Thr Arg Thr Tyr Asp Leu Tyr He Thr Tyr Asp Lys Tyr 
yi 140 145 150 

j^^ac cag act cca cga tta tgg ttg ttt ggc tat gat gag caa egg cag 728 
j:|Tyr Gin Thr Pro Arg Leu Trp Leu Phe Gly Tyr Asp Glu Gin Arg Gin 
IJl 160 165 

■'bet tta aca gtt gag cac atg tat gaa gae ate agt cag gat cat gtg 776 
:\Pro Leu Thr Val Glu His Met Tyr Glu Asp He Ser Gin Asp His Val 
hi ^"75 180 

vhag aaa aca gtg ace att gaa aat cat cet eat etg cca cca cct ccc 824 
h^s Lys Thr Val Thr He Glu Asn His Pro His Leu Pro Pro Pro Pro 
H 190 195 

y;^tg tgt tea gtt cac cca tgc agg cat get gag gtg atg aag aaa ate 872 
fymt Cys Ser Val His Pro Cys Arg His Ala Glu Val Met Lys Lys He 
■'200 205 210 215 

att gag act gtt gea gaa gga ggg gga gaa ett gga gtt cat atg tat 920 
He Glu Thr Val Ala Glu Gly Gly Gly Glu Leu Gly Val His Met Tyr 

220 225 230 

ett ett att ttc ttg aaa ttt gta caa get gte att cca aca ata gaa 968 
Leu Leu He Phe Leu Lys Phe Val Gin Ala Val He Pro Thr He Glu 

235 240 245 

tat gae tac aca aga cac ttc aca atg taatgaagag ageataaaat 1015 
Tyr Asp Tyr Thr Arg His Phe Thr Met 

250 255 
ctatcetaat tattggttet gatttttaaa gaattaaeec atagatgtga ccattgacca 1075 
tattcatcaa tatatacagt ttctctaata agggacttat atgtttatge attaaataaa 1135 
aatatgttcc actaccagcc ttaettgttt aataaaaate agtgcaaaaa aaaaaa 1191 



<210> 63 
<211> 1008 
<212> DNA 
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<213> Homo sapiens 

<220> 
<221> CDS 
<222> 657. ,923 

<221> sig__peptide 
<222> 657. .896 
<223> Von Heijne matrix 
score 3 . 5 

seq RGLLSACAPWGDG/ST 

<221> polyA_signal 
<222> 957. .962 

<221> polyA_site 
<222> 974. .1008 

<400> 63 

ntcgnatgtg gcacaaaacc cctctgctgg ctcatgtgtg caactgagac tgtcagagca 60 

tggctagctc tggggtccag ctctgctggg tgggggctag agaggaagca gggagtatct 120 

.,,,gcacacagga tgcctgcgct caggtggttg cagaagtcag tgcccaggcc cccccacaca 180 

^^l^gtccccaaag gtccggcctc cccagcgcgg ggctcctcgt ttgaggggag gtgacttccc 240 

lif'ftcccagcagg ctcttggaca cagtaagctt ccccagccct gcctgagcag cctttcctcc 3 00 

j'ijttgccctgtt ccccacctcc cggctccagt ccagggagct cccagggaag tggtcgaccc 360 

|:^k;tccagtggc tgggccactc tgctagagtc catccgccaa gctgggggca tcggcaaggc 420 

i/|caagctgcgc agcatgaagg agcgaaagct ggagaagaag aagcagaagg agcaggagca 480 

„pagtgagagcc acgagccaag gtgggcactt gatgtcggat ctcttcaaca agctggtcat 540 

ijjgaggcgcaag ggcatctctg ggaaagaacc tggggctggt gaggggcccg gaggagcctt 600 

ll^gcccgcgtg tcagactcca tccctcctct gccgccaccg cagcagccac aggtag atg 659 



Met 
-80 





aca 


agg 


acg 


act 


ggg 


aat 


cct 


agg 


ggg 


etc 


cat 


gac 


ace 


tte 


ccc 


707 


nj\rg 


Thr 


Arg 


Thr 


Thr 


Gly 


Asn 


Pro 


Arg 


Gly 


Leu 


His 


Asp 


Thr 


Phe 


Pro 












-75 










-70 










-65 






^^^Jcgc 


aga 


ccc 


aga 


ctt 


ggc 


cgt 


tgc 


tct 


gac 


atg 


gac 


aca 


gcc 


agg 


aca 


755 


;jy^rg 


Arg 


Pro 


Arg 


Leu 


Gly 


Arg 


Cys 


Ser 


Asp 


Met 


Asp 


Thr 


Ala 


Arg 


Thr 










-60 










-55 










-50 






"''age 


tgo 


tea 


gac 


ctg 


ctt 


ccc 


tgg 


gag 


ggg 


gtg 


acg 


gaa 


cca 


gca 


ctg 


803 


Ser 


Cys 


Ser 
-45 


Asp 


Leu 


Leu 


Pro 


Trp 
-40 


Glu 


Gly 


Val 


Thr 


Glu 
-35 


Pro 


Ala 


Leu 




tgt 


gga 


gac 


cag 


ctt 


caa 


gga 


acg 


gaa 


ggc 


tgg 


ctt 


gag 


gcc 


aca 


cag 


851 


Cys 


Gly 
-30 


Asp 


Gin 


Leu 


Gin 


Gly 
-25 


Thr 


Glu 


Gly 


Trp 


Leu 
-20 


Glu 


Ala 


Thr 


Gin 




ctg 


ggg 


egg 


gga 


ctt 


ctg 


tct 


gcc 


tgt 


get 


cca 


tgg 


ggg 


gac 


ggc 


tec 


899 


Leu 


Gly 


Arg 


Gly 


Leu 


Leu 


Ser 


Ala 


Cys 


Ala 


Pro 


Trp 


Gly 


Asp 


Gly 


Ser 




-15 










-10 










-5 






1 




acc 


cag 


cct 


gtg 


cca 


ctg 


tgt 


tct 


taagaggctt ccagagaaaa cggcacacca 


953 


Thr 


Gin 


Pro 


Val 


Pro 


Leu 


Cys 


Ser 





















5 



atcaataaag aaetgagcag aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaan 1008 



<210> 64 
<211> 568 
<212> DNA 



-45- 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 18. .311 

<221> sig__peptide 
<222> 18. .62 
<223> Von Heijne matrix 
score 8 . 4 

seq AMWLLCVALAVLA/WG 
<400> 64 

agtgctgctt acccatc atg gaa gca atg tgg etc ctg tgt gtg gcg ttg 50 

Met Glu Ala Met Trp Leu Leu Cys Val Ala Leu 
-15 -10 -5 

gcg gtc ttg gca tgg ggc ttc etc tgg gtt tgg gac tec tea gaa cga 98 
Ala Val Leu Ala Trp Gly Phe Leu Trp Val Trp Asp Ser Ser Glu Arg 
15 10 



atg 


aag 


agt 


egg 


gag 


cag 


gga 


gga 


egg 


ctg 


gga 


gee 


gaa 


age 


egg acc 


146 


Met 


Lys 


Ser 


Arg 


Glu 


Gin 


Gly 


Gly 


Arg 


Leu 


Gly 


Ala 


Glu 


Ser 


Arg Thr 








15 










20 










25 






^;.^;;btg 

^Jieu 


ctg 


gtc 


ata 


gcg 


cae 


cct 


gac 


gat 


gaa 


gee 


atg 


ttt 


ttt 


get ccc 


194 


Leu 
30 


Val 


He 


Ala 


His 


Pro 
35 


Asp 


Asp 


Glu 


Ala 


Met 
40 


Phe 


Phe 


Ala Pro 






gtg 


eta 


ggc 


ttg 


gee 


cgc 


eta 


agg 


cae 


tgg 


gtg 


tac 


ctg 


ctt tgc 


242 


IvWhx 


Val 


Leu 


Gly 


Leu 


Ala 


Arg 


Leu 


Arg 


His 


Trp 


Val 


Tyr 


Leu 


Leu Cys 














50 










55 








60 




ttc 


tet 


gca 


gtt 


ttc 


cgt 


agg 


gag 


eta 


agt 


gaa 


tac 


acc 


gaa 


ggt ctt 


290 


iifehe 


Ser 


Ala 


Val 


Phe 
65 


Arg 


Arg 


Glu 


Leu 


Ser 
70 


Glu 


Tyr 


Thr 


Glu 


Gly Leu 
75 






tct 


gaa 


cec 


etc 


aca 


gee 


tagggacagg agcggccggc ttaectggtg 


341 




Ser 


Glu 


Pro 
80 


Leu 


Thr 


Ala 





















^^^i^gttggggga cgtcggeage tegcgtacta cgecagcagg attgaggagc agagaaacag 401 

^^^^,|:tgcagttgg ttgtatteag tacctgcatt tccgttggga actecacctg tacttgttat 461 

;.i|:etgtggaae tttttttatt tgtagaagga geaagaatat tgaccttact atatagcaca 521 

0;p^^^^^^^^*^ tatgctgtat cgtgcctgct caatcettaa agttaac 568 



<210> 65 

<211> 538 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 151. .426 

<221> sig_peptide 
<222> 151. .258 
<223> Von Heijne matrix 
score 5.2 

seq KVALAGLLGFGLG/KV 
<221> polyA__signal 



-46- 

<222> 505. . 510 

<221> polyA_site 
<222> 527. .538 

<400> 65 

cactgggtca aggagtaagc agaggataaa caactggaag gagagcaagc acaaagtcat 60 
catggcttca gcgtctgctc gtggaaacca agataaagat gcccattttc caccaccaag 12 0 
caagcagctc tgcctttttc tcttgtaagc atg ctt gtc acc cag gga eta gtc 174 

Met Leu Val Thr Gin Gly Leu Val 
-35 -30 



tac 


caa 


ggt 


tat 


ttg 


gca 


get 


aat 


tct 


aga 


ttt 


gga 


tea 


ttg 


cec 


aaa 


222 


Tyr 


Gin 


Gly 


Tyr 


Leu 


Ala 


Ala 


Asn 


Ser 


Arg 


Phe 


Gly 


Ser 


Leu 


Pro 


Lys 










-25 










-20 










-15 






gtt 


gca 


ctt 


get 


ggt 


etc 


ttg 


gga 


ttt 


ggc 


ctt 


gga 


aag 


gta 


tea 


tac 


270 


Val 


Ala 


Leu 
-10 


Ala 


Gly 


Leu 


Leu 


Gly 
-5 


Phe 


Gly 


Leu 


Gly 


Lys 


Val 


Ser 


Tyr 




ata 


gga 


gta 


tgc 


cag 


agt 


aaa 


ttc 


cat 


ttt 


ttt 


gaa 


1 

gat 


cag 


etc 


cgt 


318 


He 


Gly 


Val 


Cys 


Gin 


Ser 


Lys 


Phe 


His 


Phe 


Phe 


Glu 


Asp 


Gin 


Leu 


Arg 




5 










10 










15 










20 




ggg 


get 


ggt 


ttt 


ggt 


cca 


cag 


cat 


aac 


agg 


cac 


tgc 


etc 


ctt 


ace 


tgt 


366 


Gly 


Ala 


Gly 


Phe 


Gly 


Pro 


Gin 


His 


Asn 


Arg 


His 


Cys 


Leu 


Leu 


Thr 


Cys 












25 










30 










35 




^llgag 


gaa 


tgc 


aaa 


ata 


aag 


cat 


gga 


tta 


agt 


gag 


aag 


gga 


gac 


tct 


cag 


414 


i,|Glu 


Glu 


Cys 


Lys 


He 


Lys 


His 


Gly 


Leu 


Ser 


Glu 


Lys 


Gly 


Asp 


Ser 


Gin 










40 










45 










50 









iir|cct tea get tee taaattetgt gtetgtgact ttegaagttt tttaaacetc 466 
r^Pro Ser Ala Ser 

Id 55 

yljtgaatttgta eacatttaaa attteaagtg tactttaaaa taaaataett etaatggaac 526 
'aaaaaaaaaa aa cno 



iH^210> 66 
S|<211> 174 7 
^f^212> DNA 
;;vf:213> Homo sapiens 

<220> 
<221> CDS 
<222> 10. .1062 

<221> sig_peptide 
<222> 10 . . 57 
<223> Von Heijne matrix 
score 4.9 

seq FIYLQAHFTLCSG/WS 

<221> polyA_signal 
<222> 1710. .1715 

<221> polyA_site 
<222> 1735. .1747 

<400> 66 

gcctcacca atg gtt cec ttc ate tat ctg caa gee cac ttt aca etc tgt 



-47- 



Met Val Pro Phe He Tyr Leu Gin Ala His Phe Thr Leu Cys 
-15 -10 -5 

tct ggg tgg tec age aca tac egg gac etc egg aag ggt gtg tat gtg 99 
Ser Gly Trp Ser Ser Thr Tyr Arg Asp Leu Arg Lys Gly Val Tyr Val 

15 10 
cec tac acc cag ggc aag tgg gaa ggg gag ctg ggc ace gac ctg gta 147 
Pro Tyr Thr Gin Gly Lys Trp Glu Gly Glu Leu Gly Thr Asp Leu Val 
15 20 25 30 

age ate cce cat ggc cec aac gte act gtg cgt gee aae att get gee 195 
Ser He Pro His Gly Pro Asn Val Thr Val Arg Ala Asn He Ala Ala 

35 40 45 

ate act gaa tea gac aag ttc tte ate aae ggc tec aac tgg gaa ggc 243 
He Thr Glu Ser Asp Lys Phe Phe He Asn Gly Ser Asn Trp Glu Gly 

50 55 60 

ate ctg ggg ctg gee tat get gag att gee agg ect gac gac tee ecg 291 
He Leu Gly Leu Ala Tyr Ala Glu He Ala Arg Pro Asp Asp Ser Pro 

65 70 75 

gag ect ttc ttt gac tct ctg gta aag cag ace eac gtt cec aae etc 339 
Glu Pro Phe Phe Asp Ser Leu Val Lys Gin Thr His Val Pro Asn Leu 

80 85 90 

ttc tec ctg eag ett tgt ggt get ggc ttc cec etc aae cag tct gaa 3 87 

^^^Phe Ser Leu Gin Leu Cys Gly Ala Gly Phe Pro Leu Asn Gin Ser Glu 

100 105 110 

%tg ctg gee tct gte gga ggg age atg ate att gga ggt ate gac eac 43 5 

Wal Leu Ala Ser Val Gly Gly Ser Met He He Gly Gly He Asp His 

115 120 125 

J1:cg ctg tac aca ggc agt etc tgg tat aca cec ate egg egg gag tgg 483 
j;:Ser Leu Tyr Thr Gly Ser Leu Trp Tyr Thr Pro He Arg Arg Glu Trp 
ij 135 140 

^a.t tat gag gtg ate att gtg egg gtg gag ate aat gga eag gat ctg 531 
;^yr Tyr Glu Val He He Val Arg Val Glu He Asn Gly Gin Asp Leu 
VI 145 150 155 

:J%aa atg gac tgc aag gag tac aae tat gac aag age att gtg gae agt 579 
^Isys Met Asp Cys Lys Glu Tyr Asn Tyr Asp Lys Ser He Val Asp Ser 
''^ 160 165 170 

^^,^ge ace ace aac ett cgt ttg cec aag aaa gtg ttt gaa get gea gte 627 
ipiy Thr Thr Asn Leu Arg Leu Pro Lys Lys Val Phe Glu Ala Ala Val 
]|.75 180 185 190 

"aaa tee ate aag gea gee tee tee aeg gag aag ttc ect gac ggt ttc 675 
Lys Ser He Lys Ala Ala Ser Ser Thr Glu Lys Phe Pro Asp Gly Phe 

195 200 205 

tgg eta gga gag eag ctg gtg tgc tgg caa gea ggc ace ace ect tgg 723 
Trp Leu Gly Glu Gin Leu Val Cys Trp Gin Ala Gly Thr Thr Pro Trp 

210 215 220 

aac att ttc cea gte ate tea etc tac eta atg ggt gag gtt acc aac 771 
Asn He Phe Pro Val He Ser Leu Tyr Leu Met Gly Glu Val Thr Asn 

225 230 235 

eag tec tte cge ate acc ate ett ecg cag caa tac ctg egg cca gtg 819 
Gin Ser Phe Arg He Thr He Leu Pro Gin Gin Tyr Leu Arg Pro Val 

240 245 250 

gaa gat gtg gee aeg tec caa gae gac tgt tac aag ttt gee ate tea 867 
Glu Asp Val Ala Thr Ser Gin Asp Asp Cys Tyr Lys Phe Ala He Ser 
255 260 265 270 

cag tea tee aeg ggc act gtt atg gga get gtt ate atg gag ggc ttc 915 
Gin Ser Ser Thr Gly Thr Val Met Gly Ala Val He Met Glu Gly Phe 

275 280 285 

tac gtt gte ttt gat egg gee ega aaa ega att ggc ttt get gte age 963 



-48- 



Tyr Val Val Phe Asp Arg Ala Arg Lys Arg He Gly Phe Ala Val Ser 

290 295 300 

get tgc cat gtg cac gat gag ttc agg acg gca gcg gtg gaa ggc ccn 1011 
Ala Cys His Val His Asp Glu Phe Arg Thr Ala Ala Val Glu Gly Pro 

305 310 315 

ttt tgt cac ctt gga cat gga aga ctg tgg eta caa cat tec aca gac 1059 
Phe Cys His Leu Gly His Gly Arg Leu Trp Leu Gin His Ser Thr Asp 

320 325 330 

aga tgagtcaacc ctcatgacca tagcctatgt catggctgcc atctgcgccc 1112 
Arg 
335 

tcttcatgct gccactctgc ctcatggtgt gtcagtggcg ctgcctccgc tgcctgcgcc 1172 
agcagcatga tgactttgct gatgacatct ccctgctgaa gtgaggaggc ccatgggcag 1232 
aagataggga ttcccctgga ccacacctcc gtggttcact ttggtcacaa gtaggagaca 1292 
cagatggcac ctgtggccag agcacctcag gaccctcccc acccaccaaa tgcctctgcc 1352 
ttgatggaga aggaaaaggc tggcaaggtg ggttccaggg actgtacctg taggagacag 1412 
aaaagagaag aaagaagcac tctgctggcg ggaatactct tggtcacctc aaatttaagt 1472 
cgggaaattc tgctgcttga aacttcagcc ctgaaccttt gtcaccattc ctttaaattc 1532 
tccaacccaa agtattcttc ttttcttagt ttcagaagta ctggcatcac acgcaggtta 1592 
ccttggcgtg tgtccctgtg gtaccctggc agagaagaga ccaagcttgt ttccctgctg 1652 
gccaaagtca gtaggagagg atgcacagtt tgctatttgc tttagagaca gggactgtat 1712 
aaacaagcct aacattggtg caaaaaaaaa aaaaa 1747 



,4<210> 67 

,P|<211> 1686 

j;jc212> DNA 

^;<213> Homo sapiens 

^'k220> 

• <221> CDS 

;;^^<222> 78 . .491 

--^221> sig_peptide 
-,|^222> 78. .218 
iy:223> Von Heijne matrix 
Yi score 5.8 

seg LMCFGALIGLCAC/IC 

<221> polyA_signal 
<222> 1652. .1657 

<221> polyA_site 
<222> 1673. .1686 

<400> 67 

ggtatagccc accagaaagg acagagtcat ttgatgtggt cacaaaatgt gtgagtttca 60 
cactaactga gcagttc atg gag aaa ttt gtt gat ccc gga aac cac aat lio 

Met Glu Lys Phe Val Asp Pro Gly Asn His Asn 
-45 -40 
age ggg att gat etc ctt agg ace tat ctt tgg cgt tgc eag ttc ctt 158 
Ser Gly He Asp Leu Leu Arg Thr Tyr Leu Trp Arg Cys Gin Phe Leu 

-35 -30 -25 

tta cet ttt gtg agt tta ggt ttg atg tgc ttt ggg get ttg ate gga 2 06 

Leu Pro Phe Val Ser Leu Gly Leu Met Cys Phe Gly Ala Leu He Gly 
-20 -15 -10 _5 



-49- 

ctt tgt get tgc att tgc cga age tta tat ccc acc att gcc acg ggc 254 
Leu Cys Ala Cys He Cys Arg Ser Leu Tyr Pro Thr He Ala Thr Gly 

15 10 

att etc cat etc ctt gea ggt ctg tgt aea ctg gge tea gta agt tgt 302 
He Leu His Leu Leu Ala Gly Leu Cys Thr Leu Gly Ser Val Ser Cys 

15 20 25 

tat gtt get gga att gaa eta etc cac eag aaa eta gag etc ect gac 350 
Tyr Val Ala Gly He Glu Leu Leu His Gin Lys Leu Glu Leu Pro Asp 

30 35 40 

aat gta tee ggt gaa ttt gga tgg tee ttc tgc ctt get tgt gte tct 398 
Asn Val Ser Gly Glu Phe Gly Trp Ser Phe Cys Leu Ala Cys Val Ser 
45 50 55 60 

get ccc tta eag ttc atg get tct get etc ttc ate tgg get get cac 446 
Ala Pro Leu Gin Phe Met Ala Ser Ala Leu Phe He Trp Ala Ala His 

65 70 75 

acc aac egg aga gag tac ace tta atg aag gea tat cgt gtg gca 491 
Thr Asn Arg Arg Glu Tyr Thr Leu Met Lys Ala Tyr Arg Val Ala 

80 85 90 

tgagcaagaa actgcctgct ttacaattgc catttttatt tttttaaaat aatactgata 551 

tttteeccae etcteaattg tttttaattt ttatttgtgg atataceatt ttattatgaa 611 

aatctatttt atttatacae attcaecact aaataeaeae ttaataccac taaaatttat 671 

^,,^gtggtttaet ttaagegatg eeatctttca aataaactaa tetaggtcta gacagaaaga 731 

^=^|iaatggataga gaettgaeac aaatttatga aagaaaattg ggagtaggaa tgtgaccgaa 791 

y^laaeaagttgt gctaatgtct gttagaettt teagtaaaac caaagtaact gtatctgttc 851 

i ; jaactaaaaac tctatattag tttctttggg aaacetctea tegteaaaac tttatgttca 911 

H^ctttgctgtt gtagatagce agtcaaccag eagtattagt gctgttttca aagatttaag 971 

ylctetataaaa ttgggaaatt atetaagate atttteccta agcattgaca catagcttea 1031 

.J^tctgaggtga gatatggeag etgtttgtat ctgeactgtg tctgtctaca aagagtgaaa 1091 

ypatacagtgt ttacttgaaa ttttaacttt gtaaetgeaa gaattccagt tcggccgggc 1151 

n^aggattagt attattttta actcteegta agattttcag taccaecaaa ttgttttgga 1211 

^•'ttttttttet ttcctcttea cataccaggg ttattaaaag tgtgctttet ttttacatta 1271 

^^.tattaeagtt acaaggtaaa attectcaac tgctatttat ttattceage ceagtaetat 1331 

j;'"paagaaegtt tcaecataat gacecteeag agctgggaaa cctaccacaa gatctaaagt 1391 

;i^;i:ctggctgte cattaaccte caactatggt ctttatttet tgtggtaata tgatgtgeet 1451 

pH:tecttgcet aaatceette ctggtgtgta teaaeattat ttaatgtctt ctaattcagt 1511 

^^battttttat aagtatgtct ataaaeattg aactttaaaa aacttattta tttattccac 1571 

yjtaetgtagca attgaeagat taaaaaaatg taacttcata atttcttace ataaeetcaa 1631 

y|:gtetttttt aaaaaataaa attaaaaatg aaaagagace caaaaaaaaa aaaaa 1686 



<210> 68 

<211> 542 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 69. .371 

<22l> sig_jpeptide 
<222> 69. .287 
<223> Von Heijne matrix 
score 4 

seq AVGFLFWVIVLTS/WI 



<221> polyA_signal 



-50- 

<222> 510. .515 

<221> polyA_site 
<222> 530. .542 

<400> 68 

tgttacttag ggtcaaggct tgggtcttgc cccgcaaacc cttgggacga cccggcccca 60 
gcgcagct atg aac ctg gag cga gtg tec aat gag gag aaa ttg aac ctg 110 
Met Asn Leu Glu Arg Val Ser Asn Glu Glu Lys Leu Asia Leu 













-70 








-65 








-60 




tgc 


egg 


aag 


tac 


tac 


ctg 


ggg 


ggg 


ttt 


get 


ttc 


ttg 


cet 


ttt 


etc 


tgg 


158 


Cys 


Arg 


Lys 


Tyr 


Tyr 


Leu 


Gly 


Gly 


Phe 


Ala 


Phe 


Leu 


Pro 


Phe 


Leu 


Trp 












-55 










-50 










-45 




ttg 


gtc 


aac 


ate 


ttc 


tgg 


ttc 


tac 


cga 


gag 


gee 


ttc 


ett 


gtc 


eca 


gee 


206 


Leu 


Val 


Asn 


He 
-40 


Phe 


Trp 


Phe 


Tyr 


Arg 
-35 


Glu 


Ala 


Phe 


Leu 


Val 
-30 


Pro 


Ala 




tac 


aca 


gaa 


cag 


age 


caa 


ate 


aaa 


ggc 


tat 


gtc 


tgg 


cgc 


tea 


get 


gtg 


254 


Tyr 


Thr 


Glu 
-25 


Gin 


Ser 


Gin 


He 


Lys 
-20 


Gly 


Tyr 


Val 


Trp 


Arg 
-15 


Ser 


Ala 


Val 




ggc 


ttc 


etc 


ttc 


tgg 


gtg 


ata 


gtg 


etc 


ace 


tec 


tgg 


ate 


ace 


ate 


ttc 


302 


Gly 


Phe 
-10 


Leu 


Phe 


Trp 


Val 


He 
-5 


Val 


Leu 


Thr 


Ser 


Trp 
1 


He 


Thr 


He 


Phe 
5 






ate 


tac 


egg 


ccc 


cgc 


tgg 


ggt 


gee 


ett 


ggg 


gac 


tac 


etc 


tee 


ttc 


350 


^iSbln 


lie 


Tyr 


Arg 


Pro 


Arg 


Trp 


Gly 


Ala 


Leu 


Gly 


Asp 


Tyr 


Leu 


Ser 


Phe 












10 










15 










20 








ata 


ccc 


ctg 


ggc 


ace 


ccc 


tgacaacttc tgcaeatact ggggccctgc 


401 




He 


Pro 


Leu 
25 


Gly 


Thr 


Pro 























I;|ttattctccc aggacaggct ccttaaagea gaggagcetg tectgggagc eccttctcaa 461 
yS^etcctaaga cttgttctca tgteeeacgt tctetgctga eatcccccaa taaaggaeec 521 
'"taactttcaa aaaaaaaaaa a cao 



I^ik210> 69 
^^^^J<211> 1174 
y^212> DNA 
m<213> Homo sapiens 

<220> 
<221> CDS 
<222> 2 . . 757 

<221> sigjpeptide 
<222> 2 . .205 
<223> Von Heijne matrix 
score 7 . 3 

seq LRLILSPLPGAQP/QQ 

<221> polyA_site 
<222> 1160. .1174 

<400> 69 

g atg ect gag ggc ccc gag ctg eac ctg gee age cag ttt gtg aat gag 49 
Met Pro Glu Gly Pro Glu Leu His Leu Ala Ser Gin Phe Val Asn Glu 
-65 -60 -55 

gee tgc agg gcg ctg gtg ttc ggc ggc tgc gtg gag aag tec tct gtc 97 



-51- 



Ala Cys 


Arg Ala 


Leu 


Val 


Phe Gly Gly Cys 


Val 


Glu 


Lys 


Ser 


Od. V Ct J. 






-50 










-AC 






-40 








age cgc 


aac 


cct 


gag gtg 


ecc 


ttt gag age 


agt 


gee 


tae 


cgc 


is ^ r* ^ 23 


x4o 


Ser Arg 


Asn 


Pro 


Glu 


Val 


Pro 


Phe Glu Ser 


Ser 


Ala 


Tyr Arg 


He Ser 




-35 










-3 0 






-25 










get tea 


gee 


cgc 


ggc 


aag 




r^e^f^ r*^rT 


ata 


ctcr 


age 


cct 




1 Oi'X 


Ala Ser 


Ala 


Arg 


Gly Lys 




J-Jti U. ±\J.^ J-iCU 


He 


Leu 


Ser 


Pro 


Leu Pro 




-20 








-15 






-10 








— o 




ggg gcc 


eag 


cct 


caa 


eag 




r*r*i5 r*i~fT rrrr* 


ctg 


crtc 


ttc 


cgc 




^4 X 


Gly Ala 


Gin 


Pro 


Gin 
1 


Gin 


VJJ-U. 


Jrro j-jeu A±a 


Leu 


Val 


Phe 


Arg 






atg tec 


ggc 


tct 


ttt 


eag 


ctg 


gtg ecc cgc 


gag 


gag 


ctg 


10 
cca 


^y ^ 


9 Q Q 


Met Ser 


Gly Ser 


Phe 


Gin 


Leu 


Val Pro Arg 


Glu 


Glu 


Leu 


Pro 


JTii.y nJ.iD 






15 










20 






25 






gee cae 


ctg 


cgc 


ttt 


tae 


a eg 


gcc ccg cct 


ggc 


ecc 


egg 


etc 


gcc cua 


O "5 T 


Ala His 


Leu Arg 


Phe 


Tyr 


Thr 


Ala Pro Pro 


Gly Pro Arg Leu 


Ala Leu 




30 










35 






40 










tgt ttc 


gtg 


gac 


ate 


cgc 


egg 


ttc ggc cgc 


tgg 


gac 


Ctt 


ggg 


gga aag 


o o c 
■Sod 


Cys Phe 


Val 


Asp 


He 


Arg Arg Phe Gly Arg 


Trp Asp Leu Gly 


vaJLy jjys 




45 








50 






55 












tgg eag 


ecg 


ggc 


cgc 


ggg 


ecc 


tgt gte ttg 


cag 


gag 


tae 


cag 


cag ttc 


/t O 

4-5 J 


Trp Gin 


Pro Gly 


Arg Gly Pro 


Cys Val Leu 


Gin 


Glu 


Tyr 


Gin 


tj-Ln ir'ne 










65 






70 














^jj;kcrcr cracr 


aat 


gtg 


eta 


ega 


aac 


eta geg gat 


aag 


gcc 


ttt 


gac 


egg ecc 


481 


rL^^rcr Glu 


Asn 


Val 


Leu 


Arg Asn 


Leu Ala Asp 


Lys 


Ala 


Phe 


Asp 


Arg Pro 








80 








85 








90 






iMIatc tgc 


gag 


gee 


etc 


ctg 


gac 


cag agg ttc 


ttc 


aat 


ggc 


att 


ggc aac 


529 


J -lie Cys 


Glu 


Ala 


Leu 


Leu 


Asp Gin Arg Phe 


Phe 


Asn 


Gly 


He 


Gly Asn 






95 










100 






105 








:'i3tat ctg 


egg 


gca 


gag 


ate 


ctg 


tae egg ctg 


aag 


ate 


ecc 


ecc 


ttt gag 


577 


''''Tyr Leu 


Arg 


Ala 


Glu 


He 


Leu 


Tyr Arg Leu 


Lys 


He 


Pro 


Pro 






110 










115 






120 










^ll^^g gcc 


cgc 


teg 


gte 


ctg 


gag 


gcc ctg cag 


cag 


cae 


agg 


ecg 


<a.yc ccy 


o c 


Hjfcjys Ala 


Arg 


Ser 


Val 


Leu 


Glu 


Ala Leu Gin 


Gin 


His 


Arg 


Pro 


Ser Pro 




HkL2 5 








130 






135 






T ^ A 
140 




''■'^ag ctq 


ace 


ctg 


age 


eag 


aag 


ata agg ace 


aag 


ctg 


eag 


aat 


tea gac 


c "n 

673 


JIGlu Leu 


Thr 


Leu 


Ser 


Gin 


Lys 


He Arg Thr 


Lys 


Leu 


Gin 


Asn 


Ser Asp 










145 






150 










155 




"'ctg ctg 


gag 


eta 


tgt 


cae 


tea 


gtg ecc aag 


gaa 


gtg 


gte 


eag 


ttg ggt 


721 


Leu Leu 


Glu 


Leu 


Cys 


His 


Ser 


Val Pro Lys 


Glu 


Val 


Val 


Gin 


Leu Gly 








160 








165 








170 




gag gcc 


aaa 


gat 


ggc 


age 


aac 


etc tgc ttc 


age 


aaa 


tgattgtgta 


767 


Glu Ala 


Lys 


Asp 


Gly Ser Asn Leu Cys Phe 


Ser 


Lys 












175 










180 















accctggggc acttgtcccc ctctggacct gatteaccga tttggaagtt tgtageccta 827 

gctgataetc aatggaetag gcctcctcac ttgtcaatag tgtttccagg ctgggcgcag 887 

tggetcatgc ctgtggtcec ggeacttcgg gaggecgagt ggggtggctc acctgaggtc 947 

aggagttcga gaceatcctg gccaacatgg tgaaacccea tctccactaa aatgcaaaaa 1007 

attagccagg tgtggtggcg ggeacetgta gtctcagcta ctegggagga tgaggcagga 1067 

aaatcgcttg aacccaggag gtggaggttg cagttgagct gagatcgtgc cattgeactc 1127 

cagcctggge aaegagagca aaactceate tcaaaaaaaa aaaaaaa 1174 



<210> 70 
<211> 1285 
<212> DNA 



-52- 



<213> Homo sapiens 

<220> 
<221> CDS 
<222> 2. .1051 

<221> sig_peptide 
<222> 2. .205 
<223> Von Heijne matrix 
score 7 . 3 

seq LRLILSPLPGAQP/QQ 

<221> polyA_signal 
<222> 1248. .1253 

<221> polyA_site 
<222> 1272 . .1285 

<400> 70 

g atg cct gag ggc ccc gag ctg cac ctg gcc age cag ttt gtg aat gag 49 
Met Pro Glu Gly Pro Glu Leu His Leu Ala Ser Gin Phe Val Asn Glu 
:i==r, -60 -55 

^:^j^cc tgc agg gcg ctg gtg ttc ggc ggc tgc gtg gag aag tec tct gtc 97 
jl.iAla Cys Arg Ala Leu Val Phe Gly Gly Cys Val Glu Lys Ser Ser Val 
PJ -50 -45 -40 

^^ge cgc aac cct gag gtg ccc ttt gag age agt gee tac cgc ate tea 145 
y'lSer Arg Asn Pro Glu Val Pro Phe Glu Ser Ser Ala Tyr Arg lie Ser 
:|: -35 -30 -25 

l^pct tea gee cgc ggc aag gag ctg cgc ctg ata ctg age cct ctg cct 193 

.|!jAla Ser Ala Arg Gly Lys Glu Leu Arg Leu lie Leu Ser Pro Leu Pro 

"-20 -15 -10 -5 

r^^ggg gcc cag ccc caa cag gag eca ctg gcc ctg gtc ttc cgc ttc ggc 241 

^I'ply Ala Gin Pro Gin Gin Glu Pro Leu Ala Leu Val Phe Arg Phe Gly 

1 5 -LO 

ii^^^^tg tec ggc tct ttt cag ctg gtg ccc cgc gag gag ctg cca cgc cat 289 
^^%^Het Ser Gly Ser Phe Gin Leu Val Pro Arg Glu Glu Leu Pro Arg His 
::|;| 15 20 2 5 

0|P^*^ ^^g ^g*^ tac acg gcc ccg cct ggc ccc egg etc gcc eta 33 7 

''Ala His Leu Arg Phe Tyr Thr Ala Pro Pro Gly Pro Arg Leu Ala Leu 
30 35 40 

tgt ttc gtg gae ate cgc egg ttc ggc cgc tgg gac ett ggg gga aag 385 
Cys Phe Val Asp He Arg Arg Phe Gly Arg Trp Asp Leu Gly Gly Lys 
45 50 55 60 

tgg cag ccg ggc cgc ggg ccc tgt gtc ttg cag gag tac cag cag ttc 433 
Trp Gin Pro Gly Arg Gly Pro Cys Val Leu Gin Glu Tyr Gin Gin Phe 

65 70 75 

agg ctg aag ate ccc ccc ttt gag aag gcc cgc teg gtc ctg gag gcc 481 
Arg Leu Lys He Pro Pro Phe Glu Lys Ala Arg Ser Val Leu Glu Ala 

80 85 90 

ctg cag cag cac agg ccg age ccg gag ctg ace ctg age cag aag ata 529 
Leu Gin Gin His Arg Pro Ser Pro Glu Leu Thr Leu Ser Gin Lys He 

95 100 105 

agg ace aag ctg cag aat cca gac ctg ctg gag eta tgt cac tea gtg 577 
Arg Thr Lys Leu Gin Asn Pro Asp Leu Leu Glu Leu Cys His Ser Val 

110 115 120 

eee aag gaa gtg gae cag ttg ggg ggc agg ggc tae ggg tea gag age 625 
Pro Lys Glu Val Asp Gin Leu Gly Gly Arg Gly Tyr Gly Ser Glu Ser 



\ 
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125 








130 




















14 0 




ggg gag 




gac 


ttt 


get 


crce 


ttt 


cga 


gcc 




ctg 


cgc 


tgc 


-h a -f- 

uac 


ggc 


673 


Gly Glu 


Glu 


Asp 


Phe 
145 


Ala 


Ala 


Phe 


Arg 


Ala 
150 




Leu 


Arg 


*^ys 


Tyr 

1 c: tr 


Gly 




atg cca 




atg 


age 


tec 


ctq 


cag 


gac 


^yy 


cat 


ggc 


cgt 




ct 


tgg 


"7 0 1 


Met Pro 


Gly 


Met 


Ser 


Ser 


Leu 


Gin 


Asp 


Atct 

y 


His 


Gly Arg 


Thr 


x±e 


Trp 








160 










165 










1 /U 






ttc cag 


ggg 


gat 


cct 




CCq 


ttg gca 


ecc 


aaa 


ggg 


cgc 


aag 


tee 


cgc 


'~t C Cx 


Phe Gin 


Gly 


Asp 


Pro 


Gly 


Pro 


Leu 


A±a 


Pro 




Gly Arg 


Lys 


Ser 


Arg 






175 










180 










185 








aaa aag 


aaa 


tec 


aag 


gee 


aca 


cag 


ctg 


a,y L. 




gag 


gac 


aga 


gtg 


gag 


817 


Lvs Lvs 


Lys 


Ser 


Lys 


Ala 


Thr 


Gin 


Leu 


Gov 

OCX 


fro 


Glu Asp 


Arg 


Val 


Glu 




190 










195 










200 










gac get 


ttq 


eet 


cca 


age 


aag 


gcc 


cct 




^a.g 


aca 


cga 


agg 


gca 


aag 


865 


Asp Ala 


Leu 


Pro 


Pro 


Ser 


Lys 


Ala 


Pro 


Ser 


T A/c; 

J; 


Thr 


Arg 


Arg 


A±a 


Lys 




205 








210 


















220 




aga gac 


ctt 


cct 


aag 


a era 


act 


gca 


acc 


ociy 


egg 


cct 


gag 


ggg 


ace 


age 


913 


Arg Asp 

•J XT 


Leu 


Pro 


Lys 


Ara 


Thr 


Ala 


Thr 






Pro 


Glu 


vjxy 


Thr 


Ser 










225 










o "5 n 








235 






etc cag 


cag 


era r* 
yen- 


cca 




get 


ecc 


aca 


gtg 


ecc 


aag 


aag 


ggg 


agg 


agg 


961 


Leu Gin 


Gin 


Asp 


Pro 


Glu 


Ala 


Pro 


Thr 


Va.i 


Pro 


Lys 


Lys 


Gly 


Arg 


Arg 








240 










245 










250 






|laag ggg 


cga 


cag 


gca 


gcc 


tet 


ggc 


cac 


tgc 


aga 


ecc 


egg 


aag 


gte 


aag 


1009 


liLys Gly 


Arg 


Gin 


Ala 


Ala 


Ser 


Gly His 


Cys 


Arg 


Pro 


Arg 


Lys 


Val 


Lys 






255 










260 










265 








iigct gac 


ate 


cca 


tec 


ttg 


gaa 


cca 


gag 


ggg 


ace 


tea 


gee 


tet 






1051 


'|Ala Asp 


He 


Pro 


Ser 


Leu 


Glu 


Pro 


Glu 


Gly 


Thr 


Ser 


Ala 


Ser 









270 275 280 

Ijjtagcaggagg ctetccttgc ttgeactcac ectttcttat tgtcttgccc tgcatetggg llli 

yljggtetgaatt tttgggagea ggcaatatet gaaggtgcaa acaggeccta eggctgttcc 1171 

^ctgcacaact ctcatggttt taattgtacc ccatctteca catctttaaa gctcatgtga 1231 

^' aaaatgctgc atttttaata aactgataea tttgaactec aaaaaaaaaa aaaa 1285 



^-,|<210> 71 
;J^J<211> 1398 
;|^^212> DNA 
"''<213> Homo sapiens 

<220> 
<221> CDS 
<222> 2 . .1171 

<221> sig_peptide 
<222> 2. .205 
<223> Von Heijne matrix 
score 7.3 

seq LRLILSPLPGAQP/QQ 

<221> polyA_signal 
<222> 1368. .1373 

<221> polyA_site 
<222> 1386. .1398 

<400> 71 



-54- 



g atg cct gag ggc ccc gag ctg cac ctg gcc age cag ttt gtg aat gag 49 
Met Pro Glu Gly Pro Glu Leu His Leu Ala Ser Gin Phe Val Asn Glu 
-65 -60 -55 

gcc tgc agg gcg ctg gtg ttc ggc ggc tgc gtg gag aag tec tct gtc 97 
Ala Cys Arg Ala Leu Val Phe Gly Gly Cys Val Glu Lys Ser Ser Val 

-50 -45 -40 

age egc aac cct gag gtg ccc ttt gag age agt gcc tac cge ate tea 145 
Ser Arg Asn Pro Glu Val Pro Phe Glu Ser Ser Ala Tyr Arg He Ser 

-35 -30 -25 

get tea gcc cge ggc aag gag ctg cge ctg ata ctg age cct ctg cct 193 
Ala Ser Ala Arg Gly Lys Glu Leu Arg Leu He Leu Ser Pro Leu Pro 
-20 -15 -10 -5 

ggg gee cag ccc caa cag gag cea ctg gcc ctg gtc ttc cge ttc ggc 241 
Gly Ala Gin Pro Gin Gin Glu Pro Leu Ala Leu Val Phe Arg Phe Gly 

15 10 
atg tec ggc tct ttt cag ctg gtg ccc cge gag gag ctg cea cge eat 289 
Met Ser Gly Ser Phe Gin Leu Val Pro Arg Glu Glu Leu Pro Arg His 

15 20 25 

gcc cac ctg cge ttt tac aeg gee eeg cet ggc ccc egg etc gee eta 337 
Ala His Leu Arg Phe Tyr Thr Ala Pro Pro Gly Pro Arg Leu Ala Leu 
30 35 40 

,,tgt ttc gtg gac ate cge egg ttc ggc cge tgg gac ett ggg gga aag 385 
-*:ys Phe Val Asp He Arg Arg Phe Gly Arg Trp Asp Leu Gly Gly Lys 
i;i45 50 55 60 

■:Jtgg cag ccg ggc egc ggg ccc tgt gtc ttg cag gag tac cag cag ttc 433 
^:iiTrp Gin Pro Gly Arg Gly Pro Cys Val Leu Gin Glu Tyr Gin Gin Phe 
J1 65 70 75 

l^agg gag aat gtg eta ega aac eta gcg gat aag gcc ttt gac egg ccc 481 
;|A.rg Glu Asn Val Leu Arg Asn Leu Ala Asp Lys Ala Phe Asp Arg Pro 
"fl 80 85 90 

'ktc tgc gag gcc etc ctg gac cag agg ttc ttc aat ggc att ggc aac 529 
^,lle Cys Glu Ala Leu Leu Asp Gin Arg Phe Phe Asn Gly He Gly Asn 
Z 105 

•'.fcat ctg egg gea gag ate ctg tac egg ctg aag ate ccc ccc ttt gag 577 
-^yr Leu Arg Ala Glu He Leu Tyr Arg Leu Lys He Pro Pro Phe Glu 
%l 110 115 120 

j|aag gee cge teg gtc ctg gag gee ctg cag cag cac agg ccg age eeg 625 
;V|Lys Ala Arg Ser Val Leu Glu Ala Leu Gin Gin His Arg Pro Ser Pro 
"l25 130 135 140 

gag ctg ace ctg age cag aag ata agg ace aag ctg cag aat cea gac 673 
Glu Leu Thr Leu Ser Gin Lys He Arg Thr Lys Leu Gin Asn Pro Asp 

145 150 155 

ctg ctg gag eta tgt cac tea gtg ccc aag gaa gtg gtc cag ttg ggg 721 
Leu Leu Glu Leu Cys His Ser Val Pro Lys Glu Val Val Gin Leu Gly 

160 165 170 

ggc aga ggc tac ggg tea gag age ggg gag gag gac ttt get gee ttt 769 
Gly Arg Gly Tyr Gly Ser Glu Ser Gly Glu Glu Asp Phe Ala Ala Phe 

175 180 185 

ega gcc tgg ctg egc tgc tat ggc atg cea ggc atg age tec ctg cag 817 
Arg Ala Trp Leu Arg Cys Tyr Gly Met Pro Gly Met Ser Ser Leu Gin 

190 195 200 

gac egg cat ggc cgt ace ate tgg ttc cag ggg gat cct gga ccg ttg 865 
Asp Arg His Gly Arg Thr He Trp Phe Gin Gly Asp Pro Gly Pro Leu 
205 210 215 220 

gca ccc aaa ggg egc aag tec egc aaa aag aaa tee aag gee aea cag 913 
Ala Pro Lys Gly Arg Lys Ser Arg Lys Lys Lys Ser Lys Ala Thr Gin 
225 230 235 



-55- 



ctg agt cct gag gac aga gtg gag gac get ttg cct ccg age aag gcc 961 
Leu Ser Pro Glu Asp Arg Val Glu Asp Ala Leu Pro Pro Ser Lys Ala 

240 245 250 

cct tec agg aca cga agg gca aag aga gac ctt cct aag agg act gca 1009 
Pro Ser Arg Thr Arg Arg Ala Lys Arg Asp Leu Pro Lys Arg Thr Ala 

255 260 265 

acc cag egg cct gag ggg acc age etc cag cag gae cea gaa get ccc 1057 
Thr Gin Arg Pro Glu Gly Thr Ser Leu Gin Gin Asp Pro Glu Ala Pro 

270 275 280 

aca gtg ccc aag aag ggg agg agg aag ggg cga cag gca gcc tct ggc 1105 
Thr Val Pro Lys Lys Gly Arg Arg Lys Gly Arg Gin Ala Ala Ser Gly 
285 290 295 300 

cac tgc aga ccc egg aag gtc aag get gae ate cca tec ttg gaa cca 1153 
His Cys Arg Pro Arg Lys Val Lys Ala Asp He Pro Ser Leu Glu Pro 

305 310 315 

gag ggg acc tea gcc tct tagcaggagg ctctccttgc ttgcacteac 1201 
Glu Gly Thr Ser Ala Ser 
320 

eetttcttat tgtcttgccc tgcatctggg ggtctgaatt tttgggagea ggcaatatct 1261 
gaaggtgcaa acaggcecta eggctgttce ctgcacaact etcatggttt taattgtacc 1321 
ccatettcea catctttaaa getcatgtga aaaatgctgc atttttaata aaetgataca 1381 
tttgaaaaaa aaaaaaa 1398 



^i^<210> 72 

ini<211> 821 

'p<212> DNA 

ii;;:<213> Homo sapiens 

''^ "<220> 

<221> CDS 
N^222> 42. .611 

Nk221> sig_peptide 
S,k222> 42 . .287 
::f|<223> Von Heijne matrix 
score 4 . 4 

seq NLPHLQWGLTWG/HI 

<221> polyA_signal 
<222> 787. .792 

<221> polyA_site 
<222> 808 . .821 

<400> 72 

ccgttgccag ttctgcgcgt gtcctgcgtc tccagtatgg a atg tat gtt tgg ccc 56 

Met Tyr Val Trp Pro 
-80 

tgt get gtg gtc etg gcc cag tac ctt tgg ttt cac aga aga tct etg 104 
Cys Ala Val Val Leu Ala Gin Tyr Leu Trp Phe His Arg Arg Ser Leu 

-75 -70 -65 

cea ggc aag gcc ate tta gag att gga gca gga gtg age ctt cca gga 152 
Pro Gly Lys Ala He Leu Glu He Gly Ala Gly Val Ser Leu Pro Gly 

-60 -55 -50 

att ttg act gee aaa tgt ggt gca gaa gta ata etg tea gac age tea 2 00 
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He 


Leu 


Thr 


Ala 


Lys 


Cys 


Gly 


Ala 


Glu 


Val 


He 


Leu 


Ser 


Asp 


Ser 


Ser 




-45 










-40 










-35 








-30 




gaa 


ctg 


cct 


cac 


tqt 


ctq 


gaa 


gtc 


tat 


ccrcr 
^yy 


caa 


a err* 
dy 




^dd 


d L.y 


a 3 
ddC 




Glu 


Leu 


Pro 


His 


Cvs 


Leu 


Glu 


Val 


Cys 




Gin 


Ser 


y 


wXJll 




Asn 












-25 










-20 










- Xo 






aac 


ctcr 


cca 


cat 


ctcr 


cag 




crt" a 


yyci 


eta 


d^d 


t-gg 


ggt 


cat. 


3 -h 3 

aua 


UCu 


o o ^ 


Asn 


Leu 


Pro 


His 
-10 


Leu 


Gin 


Val 




- 5 


Leu 


ixir 


Trp 


oiy 


nis 


xxe 


Ser 




tCTQ 


crat 


ctt 


ctcr 


get 


eta 


cca 


cca 


caa 


^ d u 


d L. U 


sj +- ri 

dCC 


C C t 


X 

gca 


l-Cu 


gat 


344 


Trp 


Asp 


Leu 


Leu 


Ala 


Leu 


Pro 


Pro 


Gin 


Asp 


X xc 


X xc; 




AX a 


Ser 


Asp 






5 










10 






















ttc 


ttt 


gaa 


cca 


gaa 


gat 


ttt 


gaa 


gac 


a f- 1- 


M-rr 


get 


aca 


sa t- o 

aca 


cat 


•5 Q O 


Val 


Phe 


Phe 


Glu 


Pro 


Glu 


Asp 


Phe 


Glu 


Asp 


He 


Leu 


xaXd 


■L ILL. 


xxe 


Tyr 




20 










25 










■J u 














ttt 


ttg 


atq 


cac 


aag 


aat 


cec 


aag 


crtc 


caa 


1-tcr 




t" nf- 




tdC 


caa 


44U 


Phe 


Leu 


Met 


His 


Lys 


Asn 


Pro 


Lys 


Val 

V dX 






irp 


Ser 


Thr 


Tyr 


(jXn 












40 




















50 






gtt 


acrcr 


agt 


get 


gac 


fccrcr 


tea 


ctt 


gaa 


y ^ ^ 


i- h a 

u L.d 


etc 


tac 


aaa 


tgg 


gat 


488 


Val 


A^g 


Ser 


■cxjl d 




irp 


oer 


Leu 


(aXU 


Ala 


Leu 


Leu 


Tyr 


Lys 


Trp 


Asp 




















£r rv 
o u 










65 








atg 


aaa 


tgt 


gtc 


r~> ^ 


ci L. L. 


cct 




^ag 


tct 


ttt 


gat 


gca 


gac 


aaa 


gaa 


536 


Met 


Lys 


Cys 


Val 


His 




ir X. \j 


Leu 


OJ.U 


Ser 


Phe 


Asp 


Ala 


Asp 


Lys 


Glu 








70 










75 










80 










hgat 


ata 


gca 


gaa 


tct 


ace 


Ctt 


cca 


gga 


aga 


cat 


aca 


gtt 


gaa 


atg 


ctg 


584 




He 


Ala 


Glu 


Ser 


Thr 


Leu 


Pro 


Gly 


Arg 


His 


Thr 


Val 


Glu 


Met 


Leu 




•■h 


85 










90 










95 












att 


tec 


ttt 


gca 


aag 


gae 


agt 


etc 


tgaattatae ctacaaectg 




631 


p/al 


He 


Ser 


Phe 


Ala 


Lys 


Asp 


Ser 


Leu 



















hf^^ 105 

y^tctgggaca gtatcaatac tgatgagcaa ectggcacac aaactatgag cagaccaett 691 

Jcagettgaga atgcagtggg tctgaagatg gtcaagtetg tetgcettag attttgatgt 751 

NiP^^^^^^^^^ acacttaaac tcatatgaaa caaaaattaa aataegtatt acaagtaaaa 811 

aaaaaaaaaa 821 



:.|;k210> 73 
ii;V|<211> 916 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 62 . .916 

<221> sig__peptide 
<222> 62 , .757 
<223> Von Heijne matrix 
score 4.2 

seq LVTPAALRPLVLG/GN 

<221> polyA_site 
<222> 904 . . 916 

<400> 73 

cctgaatgac ttgaatgttt ccccgcetga gctaacagtc catgtgggtg attcagctct 60 
g atg gga tgt gtt ttc cag age aca gaa gac aaa cgt ata ttc aag ata 109 



-57- 



Met Gly Cys Val Phe Gin Ser Thr Glu Asp Lys Arg He Phe Lys He 
-230 -225 -220 

gac tgg act ctg tea cca gga gag cac gcc aag gac gaa tat gtg eta 157 
Asp Trp Thr Leu Ser Pro Gly Glu His Ala Lys Asp Glu Tyr Val Leu 

-215 -210 -205 

tac tat tac tec aat etc agt gtg ect att ggg cgc ttc cag aac cge 205 
Tyr Tyr Tyr Ser Asn Leu Ser Val Pro He Gly Arg Phe Gin Asn Arg 
-200 -195 -190 -185 

gta eac ttg atg ggg gac aac tta tgc aat gat gge tet etc ctg etc 253 
Val His Leu Met Gly Asp Asn Leu Cys Asn Asp Gly Ser Leu Leu Leu 

-180 -175 -170 

caa gat gtg caa gag get gac cag gga ace tat ate tgt gaa ate cgc 301 
Gin Asp Val Gin Glu Ala Asp Gin Gly Thr Tyr He Cys Glu He Arg 

-165 -160 -155 

etc aaa ggg gag age cag gtg ttc aag aag gcg gtg gta ctg cat gtg 349 
Leu Lys Gly Glu Ser Gin Val Phe Lys Lys Ala Val Val Leu His Val 

-150 -145 -140 

ctt cca gag gag ccc aaa gag etc atg gtc cat gtg ggt gga ttg att 397 
Leu Pro Glu Glu Pro Lys Glu Leu Met Val His Val Gly Gly Leu He 

-135 -130 -125 

cag atg gga tgt gtt ttc cag age aea gaa gtg aaa cac gtg aec aag 445 
^^Gln Met Gly Cys Val Phe Gin Ser Thr Glu Val Lys His Val Thr Lys 
'j-120 -115 _110 -105 

lllgta gaa tgg ata ttt tea gga egg cgc gca aag gag gag att gta ttt 493 
ijVal Glu Trp He Phe Ser Gly Arg Arg Ala Lys Glu Glu He Val Phe 

-100 -95 -90 

p|cgt tac tac cac aaa etc agg atg tet gcg gag tac tec cag age tgg 541 
,^rg Tyr Tyr His Lys Leu Arg Met Ser Ala Glu Tyr Ser Gin Ser Trp 
% -85 -80 -75 

;;^ge cac ttc eag aat cgt gtg aac ctg gtg ggg gac att ttc cgc aat 589 
= Gly His Phe Gin Asn Arg Val Asn Leu Val Gly Asp He Phe Arg Asn 

-70 -65 -60 

^igae ggt tee ate atg ctt caa gga gtg agg gag tea gat gga gga aac 637 
i|^sp Gly Ser He Met Leu Gin Gly Val Arg Glu Ser Asp Gly Gly Asn 
mI; -55 -50 -45 

tac ace tgc agt ate cac eta ggg aac ctg gtg ttc aag aaa aec att 
j.p:yr Thr Cys Ser He His Leu Gly Asn Leu Val Phe Lys Lys Thr He 
;!H0 -35 -30 -25 

■%tg ctg cat gtc age ecg gaa gag cet ega aca ctg gtg ace ecg gca 733 
Val Leu His Val Ser Pro Glu Glu Pro Arg Thr Leu Val Thr Pro Ala 

-20 -15 -10 

gee ctg agg cet ctg gtc ttg ggt ggt aat eag ttg gtg ate att gtg 7 81 

Ala Leu Arg Pro Leu Val Leu Gly Gly Asn Gin Leu Val He He Val 

-5 15 
gga att gtc tgt gee aca ate ctg ctg etc cet gtc ctg ata ttg ate 829 
Gly He Val Cys Ala Thr He Leu Leu Leu Pro Val Leu He Leu He 

10 15 20 

gtg aag aag ace tgt gga aat aag agt tea gtg aat tet aca gtc ttg 877 
Val Lys Lys Thr Cys Gly Asn Lys Ser Ser Val Asn Ser Thr Val Leu 
25 30 35 40 

gtg aag aac acg aag aag act aat eca aaa aaa aaa aaa 916 
Val Lys Asn Thr Lys Lys Thr Asn Pro Lys Lys Lys Lys 
45 50 



685 



<210> 74 



-58- 

<211> 1153 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 62. .520 

<221> polyA__signal 
<222> 1124. .1129 

<221> polyA_site 
<222> 1141. .1153 

<400> 74 



cctgaatgac ttgaatgttt ccccgcctga gctaacagtc catgtgggtg attcagctct 60 
g atg gga tgt gtt ttc cag age aca gta gac aaa tgt ata ttc aag ata 109 
Met Gly Cys Val Phe Gin Ser Thr Val Asp Lys Cys He Phe Lys He 



1 








5 










10 








15 




gac 


tgg 


act 


ctg 


tea 


cca 


gga 


gag 


cac 


gee 


aag 


gac 


gaa 


tat 


gtg 


eta 


157 


Asp 


Trp 


Thr 


Leu 
20 


Ser 


Pro 


Gly 


Glu 


His 
25 


Ala 


Lys 


Asp 


Glu 


Tyr 
30 


Val 


Leu 




'Liac 


tat 


tac 


tec 


aat 


etc 


agt 


gtg 


cet 


att 


ggg 


cgc 


ttc 


cag 


aae 


cgc 


205 


&yr 


Tyr 


Tyr 


Ser 


Asn 


Leu 


Ser 


Val 


Pro 


He 


Gly 


Arg 


Phe 


Gin 


Asn 


Arg 








35 










40 










45 










cac 


ttg 


atg 


ggg 


gac 


ate 


tta 


tgc 


aat 


gat 


ggc 


tet 


etc 


ctg 


etc 


253 


Ijyal 


His 


Leu 


Met 


Gly 


Asp 


He 


Leu 


Cys 


Asn 


Asp 


Gly 


Ser 


Leu 


Leu 


Leu 






50 










55 










60 














gat 


gtg 


caa 


gag 


get 


gac 


cag 


gga 


ace 


tat 


ate 


tgt 


gaa 


ate 


cgc 


301 


j'lbln 


Asp 


Val 


Gin 


Glu 


Ala 


Asp 


Gin 


Gly 


Thr 


Tyr 


He 


Cys 


Glu 


He 


Arg 














70 










75 










80 




7 etc 


aaa 


ggg 


gag 


age 


cag 


gtg 


ttc 


aag 


aag 


gcg 


gtg 


gta 


ctg 


cat 


gtg 


349 


j;;^eu 


Lys 


Gly 


Glu 


Ser 


Gin 


Val 


Phe 


Lys 


Lys 


Ala 


Val 


Val 


Leu 


His 


Val 












85 










90 










95 






hiatt 


cca 


gag 


gag 


ccc 


aaa 


gag 


etc 


atg 


gtc 


cat 


gtg 


ggt 


gga 


ttg 


att 


397 




Pro 


Glu 


Glu 


Pro 


Lys 


Glu 


Leu 


Met 


Val 


His 


Val 


Gly 


Gly 


Leu 


He 










100 










105 










110 








=vbag 


atg 


gga 


tgt 


gtt 


ttc 


cag 


age 


aca 


gaa 


gtg 


aaa 


cac 


gtg 


ace 


aag 


445 




Met 


Gly 


Cys 


Val 


Phe 


Gin 


Ser 


Thr 


Glu 


Val 


Lys 


His 


Val 


Thr 


Lys 








115 










120 










125 








gta 


gaa 


tgg 


ata 


ttt 


tea 


gga 


egg 


cgc 


gea 


aag 


gta 


aca 


agg 


agg 


aaa 


493 


Val 


Glu 


Trp 


He 


Phe 


Ser 


Gly 


Arg 


Arg 


Ala 


Lys 


Val 


Thr 


Arg 


Arg 


Lys 






130 










135 










140 










cat 


cac 


tgt 


gtt 


aga 


gaa 


ggc 


tet 


ggc 


tgatggtatc aggaeaaagg 




540 


His 


His 


Cys 


Val 


Arg 


Glu 


Gly 


Ser 


Gly 



















145 150 



tagaatcagg eacatgagga ggtgttgcaa gagcctggge tttggtgett ateagaactg 600 

gaecttctec tagcaattte agetttctgg tgggaaaggt aactceaatg aagaacaaga 660 

acaagaagat gatgatgatg cttaactttt tggatgeega tatgagattg taeatgtaaa 720 

gcattttgta taagacttgg cecctgeatt ttagtttcct tctttctece ttttecttcg 780 

tatagagtec atgggagaat gagggagatg atttttgtgg cceagccaag aaagcaatgg 840 

gctagacatt aaaatgatta cacttttatt cttaetgggg ttagttetgt gagtttteat 900 

ctgtgcccea ttgccceatt tatgtgatgg agggaatttt eatgggtact teacgtgttg 960 

ggattgattg atcctggggg ccagggtgaa gggtatttta cgggacetct ataaageagg 1020 

aagaagcaag tttattcttt agaccagtag ctetcaacca tgatgtggtc atatatttat 1080 

gggtcaaeat gtgttgtggg gatatcceaa gtaaettgtt attaataaaa gttaagttge 1140 

aaaaaaaaaa aaa t t 



-59- 



<210> 75 

<211> 1517 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 21. .167 



<400> 75 

ctctgaaatg cttgtctttt atg ctg gna ggt gac cat agg get ctg ctt tta 53 

Met Leu Xaa Gly Asp His Arg Ala Leu Leu Leu 
15 10 

aag ata tgg ctg ctt caa agg cca gag tea cag gaa gga ctt ctt cca 101 
Lys He Trp Leu Leu Gin Arg Pro Glu Ser Gin Glu Gly Leu Leu Pro 

15 20 25 

ggg aga tta gtg gtg atg gag agg aga gtt aaa atg acc tea tgt ect 149 
Gly Arg Leu Val Val Met Glu Arg Arg Val Lys Met Thr Ser Cys Pro 
30 35 40 

'-^ct tgt eea egg ttt tgt tgagttttca ctcttctaat gcaagggtct 197 
^liSer Cys Pro Arg Phe Cys 
ill 45 

^.iicacactgtga aceacttagg atgtgateac ttteaggtgg ecaggaatgt tgaatgtctt 257 

ifl^ggctcagtt catttaaaaa agatatetat ttgaaagtte teagagttgt acatatgttt 317 

l^cacagtacag gatctgtaca taaaagtttc tttectaaac cattcaccaa gagccaatat 3 77 

iJptaggcattt tcttggtagc acaaattttc ttattgctta gaaaattgtc ctccttgtta 437 

y^ttctgtttg taagaettaa gtgagttagg tetttaagga aagcaacgct cctctgaaat 497 

^gettgtcttt tatgctggga ggtgaccata gggctctgct tttaaagata tggctgcttc 557 

;:■ aaaggceaga gtcacaggaa ggacttctte cagggagatt agtggtgatg gagaggagag 617 

ii:;^^taaaatgac ctcatgtcct tcttgtccac ggttttgttg agttttcact cttetaatgc 677 

U.kagggtctca cactgtgaae caettaggat gtgateactt tcaggtggee aggaatgttg 737 

^^atgtctttg getcagttea tttaaaaaag atatctattt gaaagttctc agagttgtac 797 

-..ptatgtttca cagtacagga tctgtacata aaagtttctt tectaaacea ttcaccaaga 857 

ypccaatatct aggcatttte ttggtagcae aaattttctt attgcttaga aaattgtcct 917 

yjpettgttatt tctgtttgta agacttaagt gagttaggtc tttaaggaaa geaacgetcc 977 

'^^etgaaatgc ttgtctttna tgctgggagg tgaecatagg gctctgcttt taaagatatg 1037 

gctgctteaa aggceagagt eacaggaagg acttctteca gggagattag tggtgatgga 1097 

gaggagagtt aaaatgacct catgtcette ttgtccacgg ttttgttgag tttteactet 1157 

tctaatgeaa gggtetcaea etgtgaaeea cttaggatgt gatcacttte aggtggccag 1217 

gaatgttgaa tgtctttggc tcagttcatt taaaaaagat atctatttga aagttetcag 1277 

agttgtacat atgtttcaca gtacaggatc tgtacataaa agtttctttc ctaaaccatt 1337 

caccaagagc caatatctag gcattttctt ggtagcacaa attttcttat tgcttagaaa 1397 

attgtectcc ttgttatttc tgtttgtaag acttaagtga gttaggtctt taaggaaage 1457 

aacgctcctc tgaaatgctt gtcttttatg ctgggaggtg aceatagggc tctgctttta 1517 



<210> 76 

<211> 526 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 



-60- 

<222> 22. .318 

<221> sig_peptide 
<222> 22. .93 
<223> Von Heijne matrix 
score 4 . 6 

seq FFIFCSLNTLLLG/GV 

<221> polyA_signal 
<222> 497. .502 

<221> polyA_site 
<222> 516. .526 



<400> 76 



ctgcctgctg i 


cttgctgcac c 


atg 


aag 


tet gee 


aag 


etg 


gga 


ttt 


ctt 


eta 


51 














Met 


Lys 


Ser Ala 


Lys 


Leu 


Gly 


Phe 


Leu 


Leu 






















-20 










-15 




aga 


ttc 


ttc 


ate 


ttc 


tgc 


tea 


ttg 


aat acc 


etg 


tta 


ttg 


ggt 


ggt 


gtt 


99 


Arg 


Phe 


Phe 


He 


Phe 
-10 


Cys 


Ser 


Leu 


Asn Thr 
-5 


Leu 


Leu 


Leu 


Gly 


Gly 


Val 




aat 


aaa 


att 


gcg 


gag 


aag 


ata 


tgt 


gga gac 


etc 


aaa 


gat 


cec 


1 

tgc 


aaa 


147 




Lys 


He 


Ala 


Glu 


Lys 


He 


Cys 


Gly Asp 


Leu 


Lys 


Asp 


Pro 


Cys 


Lys 








5 










10 








15 










rJ^tg 


gac 


atg 


aat 


ttt 


gga 


age 


tgc 


tat gaa 


gtt 


cac 


ttt 


aga 


tat 


ttc 


195 




Asp 


Met 


Asn 


Phe 


Gly 


Ser 


Cys 


Tyr Glu 


Val 


His 


Phe 


Arg 


Tyr 


Phe 






20 










25 








30 












liac 


aac 


aga 


acc 


tec 


aaa 


aga 


tgt 


gaa act 


ttt 


gtc 


ttc 


tec 


ggc 


tgt 


243 


J'^'^Tyr 


Asn 


Arg 


Thr 


Ser 


Lys 


Arg 


Cys 


Glu Thr 


Phe 


Val 


Phe 


Ser 


Gly 


Cys 














40 








45 










50 




'^^ kat 


ggc 


aac 


ctt 


aac 


aac 


ttc 


aag 


ctt aaa 


ata 


gaa 


cgt 


gaa 


gta 


gee 


291 


Asn 


Gly 


Asn 


Leu 


Asn 


Asn 


Phe 


Lys 


Leu Lys 


He 


Glu 


Arg 


Glu 


Val 


Ala 












55 








60 










65 






rlJtgt 


gtt 


gca 


aaa 


tac 


aaa 


eca 


eeg 


agg tgagaggatg tgaaeteatg 




338 




Val 


Ala 


Lys 


Tyr 


Lys 


Pro 


Pro Arg 






















70 










75 

















^j^agttgtetg etgcaeeatc egaaataaag acaeaagaaa attcagactg attttgaaat 398 
jjl^ctttgtaata tttecataat getttaagct tecatatgtt tgetatttte ctgaccetag 458 
'•'■^tttgtcttt cctggaaatt aactgtatga teattagaat gaaagagtet ttctgtcaaa 518 



aaaaaaaa 



<210> 77 

<211> 352 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 8. .292 

<221> sig_peptide 
<222> 8 . .118 
<223> Von Heijne matrix 
score 5 . 6 

seq WLLLDALLRLGDT/KK 



-61- 



<221> polyA_signal 
<222> 317. .322 

<221> polyA_site 
<222> 339. .352 

<400> 77 



ctgagat 


atg 


gca 


agt 


ece 


get 


gta 


aac 


agg 


tgg 


aaa 


^gg 


cca 


agg 


ttg 


49 






Met 


Ala 


Ser 


Pro 


Ala 


Val 


Asn Arg 


Trp 


Lys 


TV v^y-f 


Pro 


Arg 


Leu 












-35 










-30 










-25 






aag 


ccg 


crtcr 


tcjcr 


cca 


egg 


cge 




gaa 


tec 


tgg 


ttg 


4- 4_ 

ttg 


ctg 


gat 


get 


97 


Lys 


Pro 


Val 


Trp 


Pro 


Arg Arg 


Leu 


Glu 


Ser 


Trp 


Leu 


Leu 


Leu 


Asp Ala 










-20 










-15 










-10 








ctt 


ttg 


cga 


tta 


gga 


gat 


ace 


aaa 


aaa 


aag 


cga 


eag 


cct 


gaa 


gca 


gee 


145 


Leu 


Leu 


Arg 
-5 


Leu 


Gly Asp Thr 


Lys 
1 


Lys 


Lys 


Arg 


Gin 


Pro 


Glu 


Ala 


Ala 




aca 


aaa 


tec 


tgt 


gtt 


aga 


age 


age 


tgt 


ggg 


ggt 


5 

ccc 


agt 


gga 


gat 


ggg 


193 


Thr 


Lys 


Ser 


Cys 


Val 


Arg 


Ser 


Ser 


Cys 


Gly Gly 


Pro 


Ser 


Gly 


Asp Gly 




10 










15 










20 










25 




cct 


ccc 


cca 


tgc 


etc 


eag 


cag 


cct 


gac 


cct 


cgt 


gee 


ctg 


tct 


cag 


gcg 


241 


Pro 


Pro 


Pro 


Cys 


Leu 


Gin 


Gin 


Pro 


Asp 


Pro 


Arg 


Ala 


Leu 


Ser 


Gin 


Ala 












30 










35 










40 






;f|ttC 


tct 


aga 


tec 


ttt 


cct 


ctg 


ttt 


ece 


tct 


etc 


get 


ggc 


aaa 


agt 


atg 


289 




Ser 


Arg 


Ser 


Phe 


Pro 


Leu 


Phe 


Pro 


Ser 


Leu 


Ala 


Gly 


Lys 


Ser 


Met 










45 










50 








55 









-y -J 

Jij^ate taattgaaac aagaetgaag gatcaataaa cagecatetg eccettcaaa 342 



'^^"^aaaaaaaaaa 



Mi<210> 78 

ii1|<211> 542 

i! !i<212> DNA 

;„^<213> Homo sapiens 

■:'j^220> 
0!l:221> CDS 
<222> 16 . .378 

<221> sig_peptide 
<222> 16 . , 84 
<223> Von Heijne matrix 
score 9 . 8 

seq FLLFFFLFLLTRG/SL 

<221> polyA_signal 
<222> 502 . .507 

<221> polyA_site 
<222> 522 , .542 

<400> 78 

cacgacctgt gggce atg atg eta ece caa tgg ctg ctg ctg ctg ttc ctt 51 
Met Met Leu Pro Gin Trp Leu Leu Leu Leu Phe Leu 
-20 -15 



-62- 

ctc ttc ttc ttt etc ttc etc etc ace agg gge tea ett tet eca aca 99 
Leu Phe Phe Phe Leu Phe Leu Leu Thr Arg Gly Ser Leu Ser Pro Thr 

-10 -5 15 

aaa tat aac ctt ttg gag etc aag gag tct tgc ate egg aae cag gae 147 
Lys Tyr Asn Leu Leu Glu Leu Lys Glu Ser Cys He Arg Asn Gin Asp 

10 15 20 

tgc gag act ggc tgc tgc caa cgt get cea gae aat tgc gag teg cac 195 
Cys Glu Thr Gly Cys Cys Gin Arg Ala Pro Asp Asn Cys Glu Ser His 

25 30 35 

tgc gcg gag aag ggg tec gag ggc agt ctg tgt caa acg cag gtg ttc 243 
Cys Ala Glu Lys Gly Ser Glu Gly Ser Leu Cys Gin Thr Gin Val Phe 

40 45 50 

ttt ggc caa tat aga gcg tgt cce tgc ctg egg aac ctg act tgt ata 291 
Phe Gly Gin Tyr Arg Ala Cys Pro Cys Leu Arg Asn Leu Thr Cys He 

55 60 55 

tat tea aag aat gag aaa tgg ctt age ate gee tat gge cgt tgt cag 339 
Tyr Ser Lys Asn Glu Lys Trp Leu Ser He Ala Tyr Gly Arg Cys Gin 
70 75 80 85 

aaa att gga agg cag aag ttg get aag aaa atg ttc ttc tagtgetcce 388 
Lys He Gly Arg Gin Lys Leu Ala Lys Lys Met Phe Phe 

90 95 
tecttcttgc tgcetcctec tcctccacet getcteetee etacccagag etctgtgttc 448 
;:;|accetgttce ceagagecte caccatgagt ggagggaagt ggggagtgat tgaaataaag 508 
riagctttttca atgaaaaaaa aaaaaaaaaa aaaa 542 



^=j;<210> 79 

^K211> 233 

=^'j<212> DNA 

:n<213> Homo sapiens 

:,h<22 0> 
]|<221> CDS 
^^r<222> 57. .233 

]l<4:00> 79 

i^pcaaaaccaa aaccagcace gatcccgaca tagateagtg acgtcttttt ettcag atg 59 

Met 
1 

ate eta tgt ttc ctt ctt ect cat cat cgt ctt cag gaa gee aga cag 107 
He Leu Cys Phe Leu Leu Pro His His Arg Leu Gin Glu Ala Arg Gin 

5 10 15 

att caa gta ttg aag atg ctg eca agg gaa aaa tta aga aga aga gaa 155 
He Gin Val Leu Lys Met Leu Pro Arg Glu Lys Leu Arg Arg Arg Glu 

20 25 30 

gag aga aaa caa ata aat ggg aaa aaa gaa agg aca aaa tat gaa aca 203 
Glu Arg Lys Gin He Asn Gly Lys Lys Glu Arg Thr Lys Tyr Glu Thr 

35 40 45 

cea aga aaa aga gaa gga aaa aaa aaa aaa 233 
Pro Arg Lys Arg Glu Gly Lys Lys Lys Lys 
50 55 



<210> 80 
<211> 660 



-63- 



<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 83 . .340 

<221> sig_peptide 
<222> 83. .124 
<223> Von Heijne matrix 
score 7.5 

seq VALNLILVPCCAA/WC 

<221> polyA_signal 
<222> 573. .578 

<221> polyA__site 
<222> 607. .660 

<400> 80 

gaatttgtaa aacttctgct cgtttacact gcacattgaa tacaggtaac taattggaag 60 
ppagaggggag atcactcttt tg atg gtg gcc ctg aac etc att ctg gtt ccc 112 
% Met Val Ala Leu Asn Leu lie Leu Val Pro 

1^ -10 -5 

hfcgc tgc get get tgg tgt gac cca egg agg ate eac tec eag gat gac 160 
H<:ys Cys Ala Ala Trp Cys Asp Pro Arg Arg He His Ser Gin Asp Asp 
Ul 1 5 10 

ccc cgt age tet get get gat act ggg tct gcg atg cag egg cgt 208 
Uy^^ ^sp Thr Gly Ser Ala Met Gin Arg Arg 

Ijl 15 2 0 25 

gag gcc tgg get ggt tgg aga agg tea caa ccc ttc tet gtt ggt ctg 256 
;;■ piu Ala Trp Ala Gly Trp Arg Arg Ser Gin Pro Phe Ser Val Gly Leu 
m 40 

j=|t:ct tct get gaa aga etc gag aac caa cca ggg aag ctg tec tgg agg 3 04 

J^^^^>ro Ser Ala Glu Arg Leu Glu Asn Gin Pro Gly Lys Leu Ser Trp Arg 
''^5 50 55 60 

i;|;|:ee ctg gte gga gag gga tat aga ate tgt gac etc tgacaactgt 350 
nfer Leu Val Gly Glu Gly Tyr Arg He Cys Asp Leu 
65 70 
gaagecacce tgggctacag aaaccaeagt cttcccagca attattacaa ttcttgaatt 410 
ecttggggat tttttactgc cetttcaaag eacttaagtg ttagatctaa cgtgttccag 470 
tgtetgtctg aggtgactta aaaaateaga acaaaacttc tattatccag agteatggga 530 
gagtacaccc tttccaggaa taatgttttg ggaaacactg aaatgaaate tteecagtat 590 
tataaattgt gtatttaaaa aaagaaactt ttctgaatgc ctaectggcg gtgtatacea 650 
ggcagtgtgc ggo 



<210> 81 
<211> 605 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 47. .541 



-64- 

<221> sig_peptide 
<222> 47 . .220 
<223> Von Heijne matrix 
score 5.4 

seq QLLDSVLWLGALG/LT 

<221> polyA_site 
<222> 597. .605 

<400> 81 

aaagtgggag gagcactagg tcttcccgtc acctccacct ctctcc atg acc egg 55 



Met Thr Arg 



etc 


tgc 




ecc 


aga 


cee 


gaa 


gea 


cgt 


gag 


gat 


eeg 


ate 


eca 


gtt 


cct 


103 


Leu 


Cys 


Leu 


Pro 


Arg 


Pro 


Glu 


Ala 


Arg 


Glu Asp 


Pro 


He 


Pro 


Val 


Pro 




-55 










-50 










-45 










-40 




cca 


agg 


ggc 


ctg 


ggt 


get 


ggg 


gag 


ggg 


tea 


ggt 


agt 


eca 


gtg 


egt 


eca 


151 


Pro 


Arg 


Gly 


Leu 


Gly 


Ala 


Gly 


Glu 


Gly 


Ser 


Gly 


Ser 


Pro 


Val 


Arg 


Pro 












-35 










-30 










-25 






cct 


gta 


tee 


acc 


tgg 


ggc 


eet 


age 


tgg 


gee 


cag 


etc 


ctg 


gae 


agt 


gtc 


199 


Pro 


Val 


Ser 


Thr 


Trp 


Gly 


Pro 


Ser 


Trp 


Ala 


Gin 


Leu 


Leu 


Asp 


Ser 


Val 










-20 










-15 










-10 








,.cta 


tgg 


ctg 


ggg 


gea 


eta 


gga 


ctg 


aca 


ate 


eag 


gea 


gtc 


ttt 


tee 


ace 


247 


ifLeu 


Trp 


Leu 


Gly 


Ala 


Leu 


Gly 


Leu 
1 


Thr 


He 


Gin 


Ala 
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tgg gcc tgagagecce tcceeacaac tcagtgtcct tcaaatatac aatgaeeace 591 
Trp Ala 

ettcttcaaa aaaa /-nc 



<210> 82 

<211> 396 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 46 . .285 



-65- 



<221> sig_peptide 
<222> 46. .150 
<223> Von Heijne matrix 
score 3.6 

seq LEPGLSSSAACNG/KE 

<221> polyA_signal 
<222> 364, .369 

<221> polyA_site 
<222> 385. .396 

<400> 82 

cctctacagg aatcagactc agcctctttt ggttttcagt gaagt atg cct ttt caa 57 

Met Pro Phe Gin 
-35 

ttt gga acc cag cca agg agg ttt cca gtg gaa gga gga gat tot tea 105 
Phe Gly Thr Gin Pro Arg Arg Phe Pro Val Glu Gly Gly Asp Ser Ser 

-30 -25 -20 

att gag ctg gaa cct ggg ctg age tec agt get gee tgt aat ggg aag 153 
He Glu Leu Glu Pro Gly Leu Ser Ser Ser Ala Ala Cys Asn Gly Lys 
,.,rl5 -10 -5 1 

^=:;^ag atg tea cca acc agg caa etc egg agg tgc cct gga agt cat tgc 201 

Met Ser Pro Thr Arg Gin Leu Arg Arg Cys Pro Gly Ser His Cys 
N 5 10 15 

hk:tg aca ata act gat gtt cec gtc act gtt tat gca aca acg aga aag 249 
ijlLeu Thr He Thr Asp Val Pro Val Thr Val Tyr Ala Thr Thr Arg Lys 

.p 20 25 3 0 

l^jcca cct gca caa age age aag gaa atg cat cct aaa tagcaeeatt 295 
ll^xo Pro Ala Gin Ser Ser Lys Glu Met His Pro Lys 

35 40 45 

[„aagtettttg teaaggtctg aetaggtcaa gggtaatgga ccagtatcat ctggtgatct 355 
Jj'^gtaaaeaaa taaaagtggt ggeaccttca aaaaaaaaaa a 396 



yj:210> 83 
fi%=211> 432 
'"<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 22 . .240 

<221> sigjpeptide 
<222> 22 . .84 
<223> Von Heijne matrix 
score 12 

seq VLVLCVLLLQAQG/GY 

<221> polyA_signal 
<222> 397. .402 



<221> polyA^site 
<222> 421. .432 



-66- 



<400> 83 

gctcacgctc tggtcagagt t atg gca ccc cag act ctg ctg cct gtc ctg 51 

Met Ala Pro Gin Thr Leu Leu Pro Val Leu 
-20 -15 
gtt etc tgt gtg ctg ctg ctg cag gcc cag gga gga tac cgt gac aag 99 
Val Leu Cys Val Leu Leu Leu Gin Ala Gin Gly Gly Tyr Arg Asp Lys 

-10 -5 1 5 

atg agg atg cag aga ate aag gtc tgt gag aag cga ccc age ata gat 147 
Met Arg Met Gin Arg He Lys Val Cys Glu Lys Arg Pro Ser He Asp 

10 15 20 

eta tgc ate cac cac tgt tea tgt ttc caa aag tgt gaa aca aat aag 195 
Leu Cys lie His His Cys Ser Cys Phe Gin Lys Cys Glu Thr Asn Lys 

25 30 35 

ata tgc tgt tea gee tte tgt ggg aae att tgt atg age ate eta 240 
He Cys Cys Ser Ala Phe Cys Gly Asn He Cys Met Ser He Leu 

40 45 50 

tgagtgggag agtgggctgg gatgtgcatc ctgctccctg aacccttcca tccgagactg 300 
tgeccacate cgaagcacaa ggacatcaaa teatcagcae aagaacatca acaggaatgc 360 
eacectccec agtgtctgaa ctcectgtcc ctgteaaatg aaccagaaca aatgcceatg 420 
aaaaaaaaaa aa a-^io 



I5<210> 84 

1J<211> 420 

.li.<212> DMA 

j1<213> Homo sapiens 

j<220> 

'l5<221> CDS 

' '<222> 89. .382 

:.ifc^22i> polyA^site 
ll<222> 408 . .420 

'^j|<400> 84 

jpcttgectga cceecatgtc gcetctgtag gtagaagaag tatgtcttcc tggaceccet 60 
vpgctggtgct gtaacaaaga cceatgtg atg ctg ggg gca gag aca gag gag 112 

Met Leu Gly Ala Glu Thr Glu Glu 

1 5 

aag ctg ttt gat gcc ccc ttg tec ate age aag aga gag cag ctg gaa 160 
Lys Leu Phe Asp Ala Pro Leu Ser He Ser Lys Arg Glu Gin Leu Glu 

10 15 20 

cag cag gtc cea gag aae tac ttc tat gtg cea gac ctg ggc cag gtg 208 
Gin Gin Val Pro Glu Asn Tyr Phe Tyr Val Pro Asp Leu Gly Gin Val 
25 30 35 40 

cct gag att gat gtt cea tec tac ctg cct gac ctg ccc ggc att gee 256 
Pro Glu He Asp Val Pro Ser Tyr Leu Pro Asp Leu Pro Gly He Ala 

45 50 55 

aac gac etc atg tac att gee gae ctg ggc ccc ggc att gee ccc tet 3 04 

Asn Asp Leu Met Tyr He Ala Asp Leu Gly Pro Gly He Ala Pro Ser 

60 65 70 

gee cct ggc ace att cea gaa ctg ccc ace tte cae act gag gta gee 352 
Ala Pro Gly Thr He Pro Glu Leu Pro Thr Phe His Thr Glu Val Ala 

75 80 85 

gag cct etc aag ace tac aag atg ggg tac taacagcaec accaccgcec 402 
Glu Pro Leu Lys Thr Tyr Lys Met Gly Tyr 



-67- 

90 95 
ccaccaaaaa aaaaaaaa 42 q 



<210> 85 

<211> 501 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 80 . .415 

<221> sig_peptide 
<222> 80. .142 
<223> Von Heijne matrix 
score 5.4 

seq TFCLIFGLGAVWG/LG 

<221> polyA_signal 
<222> 471. .476 

Jll<221> polyA_site 
1J<222> 488. .501 

J'|<400> 85 

|;!<:ccgcttgat tccaagaacc tcttcgatat ttatttttat ttttaaagag ggagacgatg 6 0 

jjgactgagctg atccgcacc atg gag tct egg gtc tta ctg aga aca ttc tgt 112 

Met Glu Ser Arg Val Leu Leu Arg Thr Phe Cys 
-20 -15 
',ttg ate ttc ggt etc gga gca gtt tgg ggg ctt ggt gtg gac cct tec 160 
::%eu He Phe Gly Leu Gly Ala Val Trp Gly Leu Gly Val Asp Pro Ser 
^^-10 -5 15 

-k:ta. cag att gac gtc tta aca gag tta gaa ctt ggg gag tec aeg ace 208 
:»,Leu Gin He Asp Val Leu Thr Glu Leu Glu Leu Gly Glu Ser Thr Thr 

n ^0 15 20 

;]^ga gtg cgt cag gtc ccg ggg ctg cat aat ggg aeg aaa gcc ttt etc 256 
"Gly Val Arg Gin Val Pro Gly Leu His Asn Gly Thr Lys Ala Phe Leu 
25 30 35 

ttt caa gat act ccc aga age ata aaa gca tee act get aca get gaa 304 
Phe Gin Asp Thr Pro Arg Ser He Lys Ala Ser Thr Ala Thr Ala Glu 

40 45 50 

cag ttt ttt cag aag ctg aga aat aaa cat gaa ttt act att ttg gtg 352 
Gin Phe Phe Gin Lys Leu Arg Asn Lys His Glu Phe Thr He Leu Val 
55 60 65 70 

ace eta aaa cag acc eac tta aat tea gga gtt att etc tea att cac 400 
Thr Leu Lys Gin Thr His Leu Asn Ser Gly Val He Leu Ser He His 

75 80 85 

cac ttg gat eac agg taaatgtggt tgetggagtt tectgtgttt tcattatatg 455 
His Leu Asp His Arg 
90 

tggttaaatg aatatattaa agagaagtaa acaaaaaaaa aaaaaa 501 



<210> 86 



-68- 



<211> 454 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 152. .361 



<221> sig_jpeptide 
<222> 152. .283 
<223> Von Heijne matrix 
score 4.7 

seq FLLSLSLITYCFW/DP 



<400> 86 

gacattttac ttttttctgt taacgcttac cctagaaatt agaaatgaca ccacgtattc 60 
ttagcgaagt ccagttttca gcattttgtc cttattggac aatagcaagg atattagaac 120 
gtgttggttc cgcgtgcttc cgtcttgagt t atg tgc tgc tat tgt egg ata 172 

Met Cys Cys Tyr Cys Arg lie 
-40 

ttt tgt ctt aga tgt acg tac ttt cct gtt cat tgt ggt atg tgt aat 
^^^,Phe Cys Leu Arg Cys Thr Tyr Phe Pro Val His Cys Gly Met Cys Asn 
t -30 -25 

J.^ttg cgt tac ttt gaa ttt tec acg ttt tta ctt tct ttg tct etc ate 
IjLeu Arg Tyr Phe Glu Phe Ser Thr Phe Leu Leu Ser Leu Ser Leu He 

-20 -15 -10 

Jiact tac tgc ttt tgg gae cee cec cat egg ggt tea cat tec etc tec 
JI^Thr Tyr Cys Phe Trp Asp Pro Pro His Arg Gly Ser His Ser Leu Ser 
jj-S 15 10 

li'^eta gag eac act cec ttg gat ttc etc gag tgg ggt ctg etg egg 361 
' lieu Glu His Thr Pro Leu Asp Phe Leu Glu Trp Gly Leu Leu Arg 
^■L 15 2 0 25 

;;^^gaagctttc ceattttatg tgeagattat tttcagaggg tatatagaat teaggcaget 421 
^gtttcgttgt ageacattaa aaatattttc cec 454 



220 



268 



316 



V|<210> 87 
''<211> 1272 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 32. .307 

<221> sig_j)eptide 
<222> 32 . . 70 
<223> Von Heijne matrix 
score 4.2 

seq MLFSLSLLSNLNQ/IG 

<221> polyA__signal 
<222> 1240. .1245 



<221> polyA_site 
<222> 1261. .1272 



-69- 



100 



148 



292 



<400> 87 

gtcaggttgc accgcccttt ggttcccgag c atg ctg ttt tct etc age ctt 52 

Met Leu Phe Ser Leu Ser Leu 
-10 

etc tec aac ctt aac caa ate ggc age age eae etc gac cgc eca eae 
Leu Ser Asn Leu Asn Gin He Gly Ser Ser His Leu Asp Arg Pro His 

-^15 10 
att cet ggc caa tea get eag ctg ttt att tac caa atg tct tea caa 
He Pro Gly Gin Ser Ala Gin Leu Phe He Tyr Gin Met Ser Ser Gin 

15 20 25 

caa eta eag eag eag cet teg get aac aaa aaa gea gga aaa ate cac 196 
Gin Leu Gin Gin Gin Pro Ser Ala Asn Lys Lys Ala Gly Lys He His 

30 35 40 

aac ace cce tte gee aac caa eta aat cea aeg caa eat ctg gea aaa 244 
Asn Thr Pro Phe Ala Asn Gin Leu Asn Pro Thr Gin His Leu Ala Lys 

45 50 55 

cet ttt eag caa att ctt cet ggc egt cag tec ggc age etc ace tea 
Pro Phe Gin Gin He Leu Pro Gly Arg Gin Ser Gly Ser Leu Thr Ser 

60 65 70 

eca ttt eta get tgc tgaaacccaa aactaatcte caagaaggag aagcttctet 347 
,.^j^Pro Phe Leu Ala Cys 

•;^^^egcagcegga gcaggtccet ttetagagat aggagaagag agagatcget gtctegggag 407 
lijagaaatcaea agccgtcceg atecttctet aggtctcgta gtcgatttag gteaaatgaa 467 
pi^ggaaataga agaeagtttg caagagaagt ggtgtacagg aaattacttc atttgacagg 527 
Iflagtatgtaca gaaaattcaa gttttgtttg agacttcata agcttggtgc atttttaaga 587 
j;:tgttttagct gttcaaatct gtttgtctct tgaaacagtg acacaaaagt gtaattctct 647 
yjatggtttgaa atggatcata egaggcatgt aataccaaga attgttactt tacaatgttc 707 
y^icettaagcaa aattgaattt gctttgaact tttagttatg eacagactga taataaacct 767 
J'ctaaacctgc ccagcggaag tgtgtttttt tttaaattta aatacagaaa caactggcaa 827 
^■^^aaattgaact aagatttact tttttttcca tagctgggat ataggctgea gctatagttg 887 

^aacaageagt ctttaaaaac tgctgtgaaa eacaggceat cagggaaaac gaaatgctgc 947 
^^.aactattaaat tagaggtttt tgaaaaatce aactctcatc ctgggeagag gttgectagt 1007 
ii'^J^ggtatagaa tgttaagttt caagaaagtt tacctttgct ttaggtcgta agttccttat 1067 
^^^Jttgattgccg tatatggata eatggctgtt cgtgacatte tttatgtgca aatttgtgat 1127 
^|;|:tcaaaaatg tectgecagt ttaagggtae attgtagagc cgaactttga gttactgtgc 1187 
gpagatttttt ttcatgctgt catttgtaat atgttttgtg agaatecttg ggattaaagt 1247 

tttggttaca gattaaaaaa aaaaa 1272 



<210> 88 

<211> 804 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 114. .734 



<221> sig__peptide 
<222> 114. .239 
<223> Von Heijne matrix 
score 5.2 

seq LLFDLVCHEFCQS/DD 



-70- 

<221> polyA_signal 
<222> 768. .773 

<22l> polyA_site 
<222> 793 . .804 

<400> 88 

ccaacaccag gaagagtctg aagagcagcc agtgtttcgg cttgtgccct gtatacttga 60 
agctgccaaa caagtacggt agttctgaaa atccagaatg gcttgatgtt tac atg 116 

Met 

cac att tta caa ctg ctt act aca gtg gat gat gga att caa gca att 164 
His He Leu Gin Leu Leu Thr Thr Val Asp Asp Gly He Gin Ala He 

-40 -35 -30 

gta cat tgt cct gac act gga aaa gac att tgg aat tta ctt ttt gac 212 
Val His Cys Pro Asp Thr Gly Lys Asp He Trp Asn Leu Leu Phe Asp 
-25 -20 -15 -10 

ctg gtc tgc cat gaa ttc tgc cag tct gat gat cca ccc ate att ctt 260 
Leu Val Cys His Glu Phe Cys Gin Ser Asp Asp Pro Pro He He Leu 

-5 15 
caa gaa cag aaa aca gtg eta gcc tct gtt ttt tea gtg ttg tct gee 3 08 

Gin Glu Gin Lys Thr Val Leu Ala Ser Val Phe Ser Val Leu Ser Ala 
10 15 20 

'-fate tat gcc tea cag act gag caa gag tat eta aag ata gaa aaa gta 356 
|f.;Ile Tyr Ala Ser Gin Thr Glu Gin Glu Tyr Leu Lys He Glu Lys Val 
I'll 25 30 35 

Mgat ctt cct eta att gac age etc att egg gtc tta caa aat atg gaa 404 
IJ'lAsp Leu Pro Leu He Asp Ser Leu He Arg Val Leu Gin Asn Met Glu 

45 50 55 

■ jcag tgt cag aaa aaa cca gag aac teg gca gag tct aac aca gag gaa 452 
j.pln Cys Gin Lys Lys Pro Glu Asn Ser Ala Glu Ser Asn Thr Glu Glu 

60 65 70 

■ ,aet aaa agg act gat tta ace caa gat gat etc cac ttg aaa ate tta 500 
|:;'Thr Lys Arg Thr Asp Leu Thr Gin Asp Asp Leu His Leu Lys He Leu 

['^ 75 80 85 

i^^^ag gat att tta tgt gaa ttt ctt tct aat att ttt eag gca tta aca 548 
^^=^jLys Asp He Leu Cys Glu Phe Leu Ser Asn He Phe Gin Ala Leu Thr 

90 95 100 

v^^ag gag aeg gtg get eag gga gta aag gaa gge cag ttg age aaa cag 596 
Lys Glu Thr Val Ala Gin Gly Val Lys Glu Gly Gin Leu Ser Lys Gin 

105 110 115 

aag tgt tec tct gca ttt caa aac ctt ctt cct ttc tat age cct gtg 644 
Lys Cys Ser Ser Ala Phe Gin Asn Leu Leu Pro Phe Tyr Ser Pro Val 
120 125 130 135 

gtg gaa gat ttt att aaa ate eta cgt gaa gtt gat aag gcg ctt get 692 
Val Glu Asp Phe He Lys He Leu Arg Glu Val Asp Lys Ala Leu Ala 

140 145 150 

gat gac ttg gaa aaa aac ttc cca agt ttg aag gtt eag act 734 
Asp Asp Leu Glu Lys Asn Phe Pro Ser Leu Lys Val Gin Thr 
155 160 165 

taaaacctga attggaatta cttctgtaca agaaataaac tttattttte tcactgaeaa 794 
aaaaaaaaaa 



<210> 89 
<211> 802 
<212> DNA 



-71- 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 199. .801 

<221> polyA_signal 
<222> 780. .785 

<221> polyA__site 
<222> 791. .802 

<400> 89 

agtcaccgcc tgcttcgcac tgagcctccc gactcagact ctgagtccag ctccgaagag 60 
gaagaggaat tcggtgtggt tggaaatcgc tctcgctttg ccaagggaga ctatttacga 120 
tgctgcaaga tctgttatcc gctctgtggt tttgtcatcc ttgctgcctg tgttgtggcc 180 
tgtgttggct tggtgtgg atg cag gtt get etc aag gag gat ctg gat gcc 231 

Met Gin Val Ala Leu Lys Glu Asp Leu Asp Ala 
15 10 
etc aag gaa aaa ttt cga aca atg gaa tct aat cag aaa age tea ttc 2 79 

Leu Lys Glu Lys Phe Arg Thr Met Glu Ser Asn Gin Lys Ser Ser Phe 
15 20 25 

^-j;^caa gaa ate ccc aaa ctt aat gaa gaa eta etc age aag caa aaa caa 327 
^l*lGln Glu He Pro Lys Leu Asn Glu Glu Leu Leu Ser Lys Gin Lys Gin 
nj 30 35 40 

fijctt gag aag att gaa tct gga gag atg ggt ttg aac aaa gtc tgg ata 3 75 

y%eu Glu Lys He Glu Ser Gly Glu Met Gly Leu Asn Lys Val Trp He 
45 50 55 

^tg aat aag cag att tct ctg ttg act tct gca gtg 423 
u^^sn He Thr Glu Met Asn Lys Gin He Ser Leu Leu Thr Ser Ala Val 
;;''60 65 70 75 

:;■ aac cac etc aaa gcc aat gtt aag tea get gca gac ttg att age ctg 471 
^:^A.sn His Leu Lys Ala Asn Val Lys Ser Ala Ala Asp Leu He Ser Leu 

80 85 90 

hk:ot ace act gta gag gga ctt cag aag agt gta get tec att ggc aat 519 
ro Thr Thr Val Glu Gly Leu Gin Lys Ser Val Ala Ser He Gly Asn 
55 100 105 

I'Vpct tta aac age gte cat ctt get gtg gaa gca eta cag aaa act gtg 567 
Thr Leu Asn Ser Val His Leu Ala Val Glu Ala Leu Gin Lys Thr Val 

110 115 120 

gat gaa cac aag aaa acg atg gaa tta ctg cag agt gat atg aat cag 615 
Asp Glu His Lys Lys Thr Met Glu Leu Leu Gin Ser Asp Met Asn Gin 

125 130 135 

cae ttc ttg aag gag act cet gga age aac cag ate att ceg tea cct 663 
His Phe Leu Lys Glu Thr Pro Gly Ser Asn Gin He He Pro Ser Pro 
140 145 150 155 

tea gee aca tea gaa ctt gac aat aaa ace cac agt gag aat ttg aaa 7li 
Ser Ala Thr Ser Glu Leu Asp Asn Lys Thr His Ser Glu Asn Leu Lys 

160 165 170 

cag atg ggt gat aga tct gcc act ctg aaa aga cag tct ttg gac caa 759 
Gin Met Gly Asp Arg Ser Ala Thr Leu Lys Arg Gin Ser Leu Asp Gin 

175 180 185 

gtc ace aac aga aca gat aca gta aaa ate caa aaa aaa aaa a 802 
Val Thr Asn Arg Thr Asp Thr Val Lys He Gin Lys Lys Lys 
150 195 200 



-72- 



<210> 90 

<211> 1490 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 38. .1174 



<221> sig_peptide 
<222> 38 . . 148 
<223> Von Heijne matrix 
score 7.3 

seq LLSACLVTLWGLG/EP 

<221> polyA_signal 
<222> 1452 . .1457 



<221> polyA_site 
<222> 1478 . .1490 

i^|l<400> 90 

jJicatcatcca gagcagccag tgtccgggag gcagaag atg ccc cac tec age ctg 55 
l'"^ Met Pro His Ser Ser Leu 

-35 

ylcat cca tec ate ecg tgt ccc agg ggt cac ggg gcc cag aag gca gcc 103 
,|pis Pro Ser He Pro Cys Pro Arg Gly His Gly Ala Gin Lys Ala Ala 
y -30 -25 -20 

ipftg gtt ctg ctg agt gee tge ctg gtg ace ctt tgg ggg eta gga gag 151 
;'ljeu Val Leu Leu Ser Ala Cys Leu Val Thr Leu Trp Gly Leu Gly Glu 

-10 -5 1 

l^poa. cca gag cac act etc egg tac ctg gte etc cac eta gcc tec ctg 199 
j'4>ro Pro Glu His Thr Leu Arg Tyr Leu Val Leu His Leu Ala Ser Leu 

5 10 15 

'-pa.g ctg gga ctg ctg tta aac ggg gtc tge age ctg get gag gag ctg 247 
b|pin Leu Gly Leu Leu Leu Asn Gly Val Cys Ser Leu Ala Glu Glu Leu 
m 20 25 30 

cge cac ate cac tee agg tac egg ggc age tac tgg agg act gtg egg 295 
Arg His He His Ser Arg Tyr Arg Gly Ser Tyr Trp Arg Thr Val Arg 

35 40 45 

gee tge ctg ggc tge ccc etc cgc cgt ggg gcc ctg ttg ctg ctg tec 343 
Ala Cys Leu Gly Cys Pro Leu Arg Arg Gly Ala Leu Leu Leu Leu Ser 
50 55 60 65 

ate tat ttc tac tac tec etc cca aat gcg gtc ggc ecg ccc ttc act 391 
He Tyr Phe Tyr Tyr Ser Leu Pro Asn Ala Val Gly Pro Pro Phe Thr 

70 75 80 

tgg atg ctt gcc etc ctg ggc etc teg cag gca ctg aac ate etc ctg 439 
Trp Met Leu Ala Leu Leu Gly Leu Ser Gin Ala Leu Asn He Leu Leu 

85 90 95 

ggc etc aag ggc ctg gcc cca get gag ate tct gca gtg tgt gaa aaa 487 
Gly Leu Lys Gly Leu Ala Pro Ala Glu He Ser Ala Val Cys Glu Lys 

100 105 110 

ggg aat ttc aac gtg gcc cat ggg ctg gca tgg tea tat tac ate gga 535 
Gly Asn Phe Asn Val Ala His Gly Leu Ala Trp Ser Tyr Tyr He Gly 

115 120 125 

tat ctg egg ctg ate ctg cca gag etc cag gcc egg att ega act tac 583 



-73- 



Tyr Leu Arg Leu He Leu Pro Glu Leu Gin Ala Arg He Arg Thr Tyr 

130 135 140 145 

aat cag cat tac aac aac ctg eta egg ggt gca gtg age cag egg ctg 631 

Asn Gin His Tyr Asn Asn Leu Leu Arg Gly Ala Val Ser Gin Arg Leu 

150 155 160 

tat att etc etc cea ttg gac tgt ggg gtg cct gat aac ctg agt atg 679 
Tyr He Leu Leu Pro Leu Asp Cys Gly Val Pro Asp Asn Leu Ser Met 

165 170 175 

get gac cec aac att cgc ttc ctg gat aaa ctg cec cag cag acc ggt 727 
Ala Asp Pro Asn He Arg Phe Leu Asp Lys Leu Pro Gin Gin Thr Gly 

180 185 190 

gac cgt get ggc ate aag gat egg gtt tac age aac age ate tat gag 775 
Asp Arg Ala Gly He Lys Asp Arg Val Tyr Ser Asn Ser He Tyr Glu 

195 200 205 

ett ctg gag aac ggg cag egg gcg ggc acc tgt gte ctg gag tac gcc 823 
Leu Leu Glu Asn Gly Gin Arg Ala Gly Thr Cys Val Leu Glu Tyr Ala 
210 215 220 225 

acc cec ttg cag act ttg ttt gcc atg tea caa tac agt caa get ggc 871 
Thr Pro Leu Gin Thr Leu Phe Ala Met Ser Gin Tyr Ser Gin Ala Gly 

230 235 240 

ttt age egg gag gat agg ett gag cag gcc aaa etc ttc tgc egg aca 919 
.,,,Phe Ser Arg Glu Asp Arg Leu Glu Gin Ala Lys Leu Phe Cys Arg Thr 
1? 245 250 255 

;f^ptt gag gac ate ctg gca gat gcc cct gag tct cag aac aac tgc cgc 967 
VM.eu Glu Asp He Leu Ala Asp Ala Pro Glu Ser Gin Asn Asn Cys Arg 

260 265 270 

ybtc att gee tac cag gaa cct gca gat gac age age ttc teg ctg tec 1015 
^l^ljeu He Ala Tyr Gin Glu Pro Ala Asp Asp Ser Ser Phe Ser Leu Ser 
y 275 280 285 

Ij^f ag gag gtt etc egg cac ctg egg cag gag gaa aag gaa gag gtt acc 1063 
;^Gln Glu Val Leu Arg His Leu Arg Gin Glu Glu Lys Glu Glu Val Thr 
ij^^ 295 300 305 

!;:3tg ggc age ttg aag acc tea gcg gtg cec agt acc tec acg atg tec llii 
^Wal Gly Ser Leu Lys Thr Ser Ala Val Pro Ser Thr Ser Thr Met Ser 

310 315 320 

'\caa gag cct gag etc etc etc agt gga atg gga aag cec etc cct etc 1159 
kff^ln Glu Pro Glu Leu Leu Leu Ser Gly Met Gly Lys Pro Leu Pro Leu 
y1 325 330 335 

cgc acg gat ttc tct tgagacccag ggteaceagg ecagagcetc cagtggtctc 1214 

Arg Thr Asp Phe Ser 
340 

caagcetctg gaetggggge tetcttcagt ggetgaatgt ceageagage tatttcctte 1274 
cacaggggge ettgeaggga agggtccagg acttgacate ttaagatgcg tettgtccce 1334 
ttgggccagt catttccect etctgagect cggtgtcttc aacctgtgaa atgggatcat 13 94 
aatcactgee ttacctccet cacggttgtt gtgaggactg agtgtgtgga agtttttcat 1454 
aaactttgga tgctagtgta cttaaaaaaa aaaaaa 1490 



<210> 91 

<211> 361 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 26 . .361 



-74- 



<221> polyA_site 
<222> 350. .361 



<400> 91 

tcgagaagct gccccttagc caacc atg ccg tct gag ggt cgc tgc tgg gag 52 

Met Pro Ser Glu Gly Arg Cys Trp Glu 
1 5 

acc ttg aag gcc eta cgc agt tec gac aaa ggt cgc ctt tgc tac tac 100 

Thr Leu Lys Ala Leu Arg Ser Ser Asp Lys Gly Arg Leu Cys Tyr Tyr 

10 15 20 25 

cgc gac tgg ctg ctg egg cgc gag gat gtt tta gaa gaa tgt atg tct 148 

Arg Asp Trp Leu Leu Arg Arg Glu Asp Val Leu Glu Glu Cys Met Ser 

30 35 40 

ctt cec aag eta tct tct tat tct gga tgg gtg gta gag cac gtc eta 196 
Leu Pro Lys Leu Ser Ser Tyr Ser Gly Trp Val Val Glu His Val Leu 

45 50 55 

ecc cat atg cag gag aac caa cet ctg tct gag act teg cca tec tct 244 
Pro His Met Gin Glu Asn Gin Pro Leu Ser Glu Thr Ser Pro Ser Ser 

60 65 70 

acg tea get tea gee eta gat caa cec tea ttt gtt ecc aaa tct cet 292 

,^Thr Ser Ala Ser Ala Leu Asp Gin Pro Ser Phe Val Pro Lys Ser Pro 

t 75 8 0 85 

IJgae gca age tct gee ttt tec cca gcc tee cet gca aca cca aat gga 340 

Iksp Ala Ser Ser Ala Phe Ser Pro Ala Ser Pro Ala Thr Pro Asn Gly 

=^Jj90 95 100 105 

J']^cc aag ggc aaa aaa aaa aaa 3gl 

|;^hr Lys Gly Lys Lys Lys Lys 

]l 110 



,;:^V210> 92 

ll<211> 605 

^^212 > DNA 

%|<213> Homo sapiens 

J J 

:V^3<220> 

''<221> CDS 

<222> 3. .131 



<221> polyA_site 
<222> 591. .605 



<400> 92 

ea tee ctt cec cag get tta tgg ttc cag ttc ttc tac cac tct gga 47 
Ser Leu Pro Gin Ala Leu Trp Phe Gin Phe Phe Tyr His Ser Gly 
15 10 15 

age tee eta gaa tct cet gga atg ctt aat gga cet ttc cag cac cga 95 
Ser Ser Leu Glu Ser Pro Gly Met Leu Asn Gly Pro Phe Gin His Arg 

20 25 30 

aat tea aga att atg act cat egg tea gca gaa aag tgaggatace 141 
Asn Ser Arg lie Met Thr His Arg Ser Ala Glu Lys 

35 40 
ttttcctaae ctacctgctt ccectgcagt ttectcacaa tcttactett tatattttag 201 
eatatgtagc ttcteaggat gttaattetg ttctctctgt gttggtgtct gagcaeecag 261 
aaggtagagc caggggcact tataaaceag gageattatt tgacaggcac ttaagaaaga 321 



-75- 



cactggctac gtaatcccag cactttggga ggctgaggcg gatggatcac atgaggtcag 3 81 

gagttcgaga ccagcctggc cagcatggtg aaaccctgtc tctactaaaa atacaaaaat 441 

tagctgggtg tggttgcaca cgcctgtaat cccagctacc tgggaggctg aggcaggaga 501 

atcgcttgaa cttgggaggc ggaggttgca gtgagcctag attttgccat tgcactccag 561 

cctgggtgac aagggcgaaa ctccatccca aaaaaaaaaa aaaa 605 



<210> 93 

<211> 591 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 33. .185 

<221> sig_j)eptide 
<222> 33 . . 80 
<223> Von Heijne matrix 
score 3 . 7 

seq lALTLIPSMLSRA/AG 

^IJ<221> polyA^signal 
nj<222> 570. .575 

yi<22l> polyA_site 
j;:<222> 586. .591 

'!;<400> 93 

"baatcttctc agcttataac cgtctttccc tt atg eta agg ata gcc ctt aca 53 

Met Leu Arg lie Ala Leu Thr 
Z -15 -10 

■^tc ate cca tot atg ctg tea agg get get ggt tgg tgc tgg tac aag lOl 
r^^Leu He Pro Ser Met Leu Ser Arg Ala Ala Gly Trp Cys Trp Tyr Lys 
4 -5 1 5 

|;&ag ccc act cag cag ttt tct tac ctt tgc ctg ccc tgc ctt tea tgg 149 
3|plu Pro Thr Gin Gin Phe Ser Tyr Leu Cys Leu Pro Cys Leu Ser Trp 
10 15 20 

aat aag aaa ggc aac gtt ttg cag ctt cca aat ttc tgaagaaact 195 
Asn Lys Lys Gly Asn Val Leu Gin Leu Pro Asn Phe 

25 30 35 

aatctcagat tggcagttaa agteaaaatg ttgccaaata tttattcctt ttgcctaagt 255 
ttggctacec ggttcaattg ctttttattt ttaatgtctt gactettcag agttegtaec 315 
tcaaaagaac aatgagaaca tttgctttgc tttctgetga atcectaate teaacaatct 3 75 
atacctggac tgtccagttc tcctcctgtg ctatcttcte ttetatecaa gtagaatgta 435 
tgecaggagc tecttccctc tageaatttc tactaaaatg tceaagtaga atgtttcctt 495 
ttacaatcaa attactgtat ttattaattt gctagaatcc agtaaatcat tttggtagct 555 
ctggetgtgc tatcaataaa aagatgaaag eaaaaa 591 



<210> 94 

<211> 1150 

<212> DNA 

<213> Homo sapiens 



-76- 



<220> 
<221> CDS 
<222> 184. .915 



<221> sig__peptide 
<222> 184. .237 
<223> Von Heijne matrix 
score 3 . 5 

seq LLGLELSEAEAIG/AD 

<221> polyA_signal 
<222> 1119. .1124 



<221> polyA_site 
<222> 1139. .1150 



<400> 94 

cggatttgac gatggtgttc ggtcttgaat ggaaatgtag tcttaggcca gtcttaggtt 60 
tttgaacagg atagtaggta tccggagtcg attgagggcc agagcaggca ctggggttcg 12 0 
gatcctgggc aaagtttccc acgttgaggg tctcgaggac gcctagatct ctttcccagg 180 
gcc atg gcg aac ccg aag ctg ctg gga ctg gag eta age gag gcg gag 228 
Met Ala Asn Pro Lys Leu Leu Gly Leu Glu Leu Ser Glu Ala Glu 
O -15 -10 -5 

y;i&cg ate ggt get gat teg gcg ega ttt gag gag ctg ctg ctg cag gee 276 
rmia He Gly Ala Asp Ser Ala Arg Phe Glu Glu Leu Leu Leu Gin Ala 

1 5 10 

y*cg aag gag etc cag caa gee cag aca ace aga cca gaa teg aca caa 324 
jfer Lys Glu Leu Gin Gin Ala Gin Thr Thr Arg Pro Glu Ser Thr Gin 
hi 20 25 

^'■^te cag cet cag ect ggt ttc tge ata aag ace aac tee teg gaa ggg 372 

-^ile Gin Pro Gin Pro Gly Phe Cys He Lys Thr Asn Ser Ser Glu Gly 

li.^^ 35 40 45 

!i^^%ag gtt ttc ate aac ate tge eac tec ecc tct ate cet ect ccc gee 42 0 

pjLys Val Phe He Asn He Cys His Ser Pro Ser He Pro Pro Pro Ala 

50 55 60 

S^pac gtg ace gag gag gag ctg ctt cag atg eta gag gag gac caa get 468 
.jl^sp Val Thr Glu Glu Glu Leu Leu Gin Met Leu Glu Glu Asp Gin Ala 
iill ^5 70 75 

"tgg ttt cgc ate cee atg agt ctg gga gag ect eat gca gaa ctg gat 516 
Gly Phe Arg He Pro Met Ser Leu Gly Glu Pro His Ala Glu Leu Asp 

80 85 90 

gca aaa ggc cag gga tgt acc gcc tac gac gta get gtc aac age gac 564 
Ala Lys Gly Gin Gly Cys Thr Ala Tyr Asp Val Ala Val Asn Ser Asp 

95 100 105 

ttc tac egg agg atg cag aac age gat ttc ttg egg gag etc gtg ate 612 
Phe Tyr Arg Arg Met Gin Asn Ser Asp Phe Leu Arg Glu Leu Val He 

115 120 125 

acc ate gcc agg gag ggc ctt gag gac ata tac aac ttg cag ctg aat 660 
Thr He Ala Arg Glu Gly Leu Glu Asp He Tyr Asn Leu Gin Leu Asn 

130 135 -L4Q 

ccg gaa tgg cgc atg atg aag aac egg cca ttc atg ggc tec ate teg 708 
Pro Glu Trp Arg Met Met Lys Asn Arg Pro Phe Met Gly Ser He Ser 

145 150 155 

cag cag aac ate cgc teg gag cag cgt cet egg ate cag gag ctg ggg 756 
Gin Gin Asn He Arg Ser Glu Gin Arg Pro Arg He Gin Glu Leu Gly 

150 165 170 

gac ctg tac acg ccc gcc ccc ggg aga get gag tea ggg cet gaa aag 804 
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Asp Leu Tyr Thr Pro Ala Pro Gly Arg Ala Glu Ser Gly Pro Glu Lys 

175 180 185 

cct cac ctg aac ctg tgg ctg gaa gcc ccc gac etc etc ttg gcc gaa 852 
Pro His Leu Asn Leu Trp Leu Glu Ala Pro Asp Leu Leu Leu Ala Glu 
190 IBS 200 205 

gtt gac etc ccc aaa ctg gat gga gcc ctg ggg ctg teg ctg gag ate 900 
Val Asp Leu Pro Lys Leu Asp Gly Ala Leu Gly Leu Ser Leu Glu He 

210 215 220 

ggg aga ace gcc tgg tgatgggggg eccccagcag ctgtatcatc tagacgetta 955 
Gly Arg Thr Ala Trp 
225 

tatccegceg cagatcaact ctcatgagag caaggcagcc ttccaccgga agagaaagca 1015 
attaatggtg gccatgccgc ttctgccggt gccttcttga tcagggtgtc tccttgtgct 1075 
tctgagatgt ggagaagagg ctgctggctt ccctaaaagt tgaaataaaa gatttttgcc 1135 
tttaaaaaaa aaaaa it en 



<210> 95 
<211> 1513 
<212> DNA 

<213> Homo sapiens 

l!l<220> 
1J<221> CDS 
..fe<222> 58. .1116 

•M 

j|:<221> sig_peptide 
'^'J<222> 58. .159 
^j^<223> Von Heijne matrix 
' score 4 

seq lAVLYLHLYDVFG/DP 

;J<221> polyA_signal 
^^■ik222> 1486. .1491 

lf:221> polyA_site 
'^222> 1504. .1513 

<400> 95 

ctgactcctg agttctcaca acgcttgacc aataagattc gggagcttct tcagcaa 57 
^tg gag aga ggc ctg aaa tea gca gac cct egg gat ggc ace ggt tac 105 
Met Glu Arg Gly Leu Lys Ser Ala Asp Pro Arg Asp Gly Thr Gly Tyr 
-30 -25 -20 



act 


ggc 


tgg 


gca 


ggt 


att 


get 


gtg 


ctt 


tac 


tta 


cat 


ctt 


tat 


gat 


gta 


153 


Thr 


Gly 


Trp 


Ala 


Gly 


He 


Ala 


Val 


Leu 


Tyr 


Leu 


His 


Leu 


Tyr 


Asp 


Val 








-15 










-10 










-5 






ttt 


ggg 


gac 


cct 


gcc 


tac 


eta 


cag 


tta 


gca 


cat 


ggc 


tat 


gta 


aag 


caa 


201 


Phe 


Gly 


Asp 
1 


Pro 


Ala 


Tyr 


Leu 
5 


Gin 


Leu 


Ala 


His 


Gly 


Tyr 


Val 


Lys 


Gin 




agt 


ctg 


aac 


tgc 


tta 


ace 


aag 


cgc 


tec 


ate 


ace 


10 
tte 


ctt 


tgt 


ggg 


gat 


249 


Ser 


Leu 


Asn 


Cys 


Leu 


Thr 


Lys 


Arg 


Ser 


He 


Thr 


Phe 


Leu 


Cys 


Gly 


Asp 




15 










20 










25 






30 




gca 


ggc 


ccc 


ctg 


gca 


gtg 


gee 


get 


gtg 


eta 


tat 


eat 


aag 


atg 


aac 


aat 


297 


Ala 


Gly 


Pro 


Leu 


Ala 


Val 


Ala 


Ala 


Val 


Leu 


Tyr His 


Lys 


Met 


Asn 


Asn 










35 










40 








45 






gag 


aag 


eag 


gca 


gaa 


gat 


tgc 


ate 


aca 


egg 


eta 


att 


cac 


eta 


aat 


aag 


345 
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Glu Lys Gin Ala Glu Asp Cys He Thr Arg Leu He His Leu Asn Lys 

50 55 60 

att gat cct cat get cca aat gaa atg etc tat ggg cga ata ggc tac 393 
He Asp Pro His Ala Pro Asn Glu Met Leu Tyr Gly Arg He Gly Tyr 

SS 70 75 

ate tat get ctt ctt ttt gtc aat aag aac ttt gga gtg gaa aag act 441 
He Tyr Ala Leu Leu Phe Val Asn Lys Asn Phe Gly Val Glu Lys Thr 

80 85 90 

ect eaa age cat att cag cag att tgt gaa aca att tta acc tct gga 489 
Pro Gin Ser His He Gin Gin He Cys Glu Thr He Leu Thr Ser Gly 
^5 100 105 110 

gaa aac eta get agg aag aga aac ttc acg gca aag tct cca ctg atg 537 
Glu Asn Leu Ala Arg Lys Arg Asn Phe Thr Ala Lys Ser Pro Leu Met 

115 120 125 

tat gaa tgg tac cag gaa tat tat gta ggg get get cat ggc ctg get 585 
Tyr Glu Trp Tyr Gin Glu Tyr Tyr Val Gly Ala Ala His Gly Leu Ala 

130 135 140 

gga att tat tac tac ctg atg cag cec age ctt eaa gtg age eaa ggg 633 
Gly He Tyr Tyr Tyr Leu Met Gin Pro Ser Leu Gin Val Ser Gin Gly 

145 150 155 

aag tta cat agt ttg gtc aag cec agt gta gae tac gtc tge cag ctg 681 
^ Lys Leu His Ser Leu Val Lys Pro Ser Val Asp Tyr Val Cys Gin Leu 
i 170 

^.laaa ttc cct tct ggc aat tac cct cca tgt ata ggt gat aat cga gat 729 

^Msys Phe Pro Ser Gly Asn Tyr Pro Pro Cys He Gly Asp Asn Arg Asp 

;^j;l'75 180 185 190 

flctg ctt gtc eat tgg tge cat ggc gee cct ggg gta ate tac atg etc 777 

^;C.eu Leu Val His Trp Cys His Gly Ala Pro Gly Val He Tyr Met Leu 

ij 200 205 

'f^tc cag gee tat aag gta ttc aga gag gaa aag tat etc tgt gat gee 825 

^lle Gin Ala Tyr Lys Val Phe Arg Glu Glu Lys Tyr Leu Cys Asp Ala 

210 215 220 

;^tat cag tgt get gat gtg ate tgg eaa tat ggg ttg ctg aag aag gga 873 
^.^yr Gin Cys Ala Asp Val He Trp Gin Tyr Gly Leu Leu Lys Lys Gly 

225 230 235 

^|:at ggg ctg tge cac ggt tct gca ggg aat gee tat gee ttc ctg aca 921 
f^yr Gly Leu Cys His Gly Ser Ala Gly Asn Ala Tyr Ala Phe Leu Thr 
};| 240 245 250 

etc tac aac etc aca cag gae atg aag tac ctg tat agg gee tgt aag 969 

Leu Tyr Asn Leu Thr Gin Asp Met Lys Tyr Leu Tyr Arg Ala Cys Lys 

255 260 265 270 

ttt get gaa tgg tge tta gag tat gga gaa cat gga tge aga aca cca 1017 

Phe Ala Glu Trp Cys Leu Glu Tyr Gly Glu His Gly Cys Arg Thr Pro 

275 280 285 

gae ace ect ttc tct etc ttt gaa gga atg get ggg aca ata tat ttc 1065 
Asp Thr Pro Phe Ser Leu Phe Glu Gly Met Ala Gly Thr He Tyr Phe 

290 295 300 

ctg get gae ctg eta gtc cec aca aaa gee agg ttc cct gca ttt gaa 1113 
Leu Ala Asp Leu Leu Val Pro Thr Lys Ala Arg Phe Pro Ala Phe Glu 

305 310 315 

etc tgaaaggata geatgecaec tgcaactcae tgcatgacce tttetgtata 1166 
Leu 

ttcaaaecca agctaagtgc ttcegttgct ttccaaggaa acaaagagte aaactgtgga 1226 
ettgattttg ttagettttt tcagaattta tetttcattc agttccette cattatcatt 1286 
tac ttt tact tagaagtate caaggaagtc ttttaacttt aatttccatt tettectaaa 1346 
gggagagtga gtgatatgta eagtgttttg agattgtata ' catatattce agaacttgga 1406 
ggaaatctta tttaagttta tgaatataae eatctgttac tgttctaaaa atgtttaaaa 1466 
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gaaactcaat acagataaag ataaatatgt gactattaaa aaaaaaa 



1513 



<210> 96 

<211> 417 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 327. .416 

<221> polyA_site 
<222> 404. .417 

<400> 96 

tgttttgagg tgttggcatt cttcgctgat ttggctgttc ccaatgttta cattatttaa 60 
tcttgcaaaa atggttctgt gcacttggat gtgaaatgct gtccagtttt atttttttta 120 
tgttgttatc cttggatgta caaaaaattc agaaaatgat ctctgtagat attctgtttt 18 0 
attttggtca tctttagaag ttatcaggaa tgtgtttaaa acaagaagag aacttttcta 240 
^^^^ aggaatgata catagaaaag attttatttt aaaatgagtt gtaaagcttg tgtttctttg 300 
Uttgctgcaag ctatctgccc aagtta atg caa atg gac aca ttt ttt atg tea 353 

Met Gin Met Asp Thr Phe Phe Met Ser 
f'i 1 5 

yysaa aaa cac aca cac aca cac aca cat ata cac aca cac aca cga aaa 401 
iJIGlu Lys His Thr His Thr His Thr His He His Thr His Thr Arg Lys 
J^IO 15 20 25 

l^aca aaa aaa aaa aaa a 4-I.7 
JljThr Lys Lys Lys Lys 
30 



|^ik210> 97 

^^^^J<211> 603 

i?^212> DNA 

^V|<213> Homo sapiens 

<220> 
<221> CDS 
<222> 63 . ,398 

<221> sig__peptide 
<222> 63 . .206 
<223> Von Heijne matrix 
score 4 . 9 

seq PSLAAGLLFGSLA/GL 
<400> 97 

ggggccttcg tgagaccggt gcaggcctgg ggtagtctcc tgtctggaca gagaagagaa 
aa atg cag gac act ggc tea gta gtg cct ttg cat tgg ttt ggc ttt 
Met Gin Asp Thr Gly Ser Val Val Pro Leu His Trp Phe Gly Phe 
-45 -40 -35 

ggc tac gca gca ctg gtt get tct ggt ggg ate att ggc tat gta aaa 
Gly Tyr Ala Ala Leu Val Ala Ser Gly Gly He He Gly Tyr Val Lys 
-30 -25 -20 



60 
107 

155 
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gca 


ggc 


aqc 


crtQ 


cccr 


tec 


ctg 


crct 


y 


yyy 


uy 




L. t U 


ggc 


agt 


cca 


203 


Ala 


Gly 


Ser 


Val 


Pro 


Ser 


Leu 


Ala 


Ala 


Gly 


XjC Li. 




xriie 


vjrXy 


Ser 


Leu 








-15 










-10 










_ c 
- O 








gcc 


ggc 


ctg 


QQ"t 


act 


tac 


cag 


ctcr 


tct 


cag 


y ct u- 




agg 


aac 


get 


tgg 


OCT 

251 


Ala 


Gly 
1 


Leu 


Gly 


Ala 


Tvr 
5 


Gin 


Leu 


Ser 






D 

XT X U 


Arg 


Asn 


Val 


Trp 
15 




gtt 


ttc 


eta 


QCt 


aca 


tct 




acc 




y t-- 


ggc 




acg 


gga 


acg 


agg 


299 


Val 


Phe 


Leu 


Ala 


Thr 


Ser 


Gly 


Thr 


JLiC LI 




vjj.y 


xxe 


i-ieu 


Cj±y 


Met 


Arg 












20 










z o 










30 




ttc 


tac 


cac 


tct 


gga 


aaa 


ttc 


atg 


cct 


gca 


ggt 


tta 


att 


gca 


ggt 


gcc 


347 


Phe 


Tyr 


His 


Ser 


Gly 


Lys 


Phe 


Met 


Pro 


Ala 


Gly 


Leu 


He 


Ala 


Gly 


Ala 










35 










40 










45 






agt 


ttg 


ctg 


atg 


gtc 


gcc 


aaa 


gtt 


gga 


gtt 


agt 


atg 


ttc 


aac 


aga 


ccc 


395 


Ser 


Leu 


Leu 


Met 


Val 


Ala 


Lys 


Val 


Gly 


Val 


Ser 


Met 


Phe 


Asn 


Arg 


Pro 








50 










55 










60 









cat tagcagaagt catgttccag cttagactga tgaagaatta aaaatctgca 448 
His 

tcttccacta ttttcaatat attaagagaa ataagtgcag catttttgca tctgacattt 508 

tacctaaaaa aaaagacacc aaacttggca gagaggtgga aaatcagtca tgattacaaa 568 

cctacagagg tggcgagtat gtaacacaag agctt 603 



^i,|<210> 98 

i1,k211> 522 

Mi<212> DNA 

y'|<213> Homo sapiens 

^220> 
^^jj:221> CDS 
Vc222> 2. .163 

^;;^221> polyA_signal 
ini<222> 488 . .493 

^^^':|:221> polyA_site 
^jJ|c222> 511. ,522 

'''<400> 98 

c gag att gcg ggc tat ggc gcc gaa ggt ttt teg tea gta ctg gga tat 49 
Glu He Ala Gly Tyr Gly Ala Glu Gly Phe Ser Ser Val Leu Gly Tyr 
15 10 15 

ccc cga tgg cac cga ttg cea ceg caa age eta cag eac cac cag tat 97 
Pro Arg Trp His Arg Leu Pro Pro Gin Ser Leu Gin His His Gin Tyr 

20 25 30 

tgc cag cgt ege tgg cct gac egc cge tgc eta cag agt cac act caa 145 
Cys Gin Arg Arg Trp Pro Asp Arg Arg Cys Leu Gin Ser His Thr Gin 

35 40 45 

tec tee ggg cac ett cct nntgaaggag tggctaaggt tggacaatac 193 
Ser Ser Gly His Leu Pro 
50 

acgtteaetg cagctgetgt cggggccgtg tttggcctca ceaectgcat cagcgcccat 253 
gtccgcgaga ageccgacga ceccetgaac taettecceg gtggetgegc cnggaggcct 313 
gactctggga gcacgcacgc acaactacgg gattggcgcc gecgectgeg tgtaetttgg 373 
catageggce tccctggtea agatgggecg gctggagggc tgggaggtgt ttgeaaaace 433 
eaaggtgtga gecctgtgec tgcegggace tecagcctgc agaatgegte cagaaataaa 493 
ttctgtgtet gtgtgtgaaa aaaaaaaaa 522 
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<210> 99 

<211> 956 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 13. .465 



<221> sig_peptide 
<222> 13 . . 75 
<223> Von Heijne matrix 
score 3 . 9 

seq PVAVTAAVAPVLS/IN 



<400> 99 

ngagtcggga aa atg get gcg agt acn ten atg gnc ccg gtg get gtg acg 51 
Met Ala Ala Ser Thr Ser Met Xaa Pro Val Ala Val Thr 
-20 -15 -10 

^h.pcg gca gtg gcg cct gtc ctg tec ata aac age gat ttc tea gat ttg 99 
^i|,|Ala Ala Val Ala Pro Val Leu Ser He Asn Ser Asp Phe Ser Asp Leu 
PJ -5 1 5 

huzgg gaa att aaa aag caa ctg ctg ett att gcg ggc ctt acc egg gag 147 
imrg Glu He Lys Lys Gin Leu Leu Leu He Ala Gly Leu Thr Arg Glu 

j;^ 10 15 20 

ihfgg ggc eta eta cae agt age aaa tgg teg gcg gag ttg get ttc tct 195 

,';^rg Gly Leu Leu His Ser Ser Lys Trp Ser Ala Glu Leu Ala Phe Ser 

''^5 30 35 40 

j= etc cct gca ttg cct cnt ggc cag ctg caa ccg cct ccg cct att aca 243 

^eu. Pro Ala Leu Pro Xaa Gly Gin Leu Gin Pro Pro Pro Pro He Thr 

45 50 55 

P^gag gaa gat gee cag gat atg gat gee tat ace ctg gee aag gee tac 291 
^^.^piu Glu Asp Ala Gin Asp Met Asp Ala Tyr Thr Leu Ala Lys Ala Tyr 
y1 60 65 70 

f^tt gae gtt aaa gag tat gat egg gca gca cat ttc ctg eat ggc tgc 339 
■"^Phe Asp Val Lys Glu Tyr Asp Arg Ala Ala His Phe Leu His Gly Cys 
75 80 85 

aat age aag aaa gee tat ttt ctg tat atg tat tec aga tat ctg gtg 387 
Asn Ser Lys Lys Ala Tyr Phe Leu Tyr Met Tyr Ser Arg Tyr Leu Val 

90 95 100 

agg gee att tta aaa tgt cat tct gee ttt agt gaa aca tee ata ttt 435 
Arg Ala He Leu Lys Cys His Ser Ala Phe Ser Glu Thr Ser He Phe 
105 110 115 120 

aga ace aat gga aaa gtt aaa tct ttt aaa tagettagca gtgggceact 485 
Arg Thr Asn Gly Lys Val Lys Ser Phe Lys 
125 130 
gaatgaatgt actttataca tagcaataat aaaaaaaaga tateataaat aaagttaaaa 545 
aggatggtag agaagaaaat attettagga atgactaaca ggataagtaa caacctgatt 605 
atttatttae tttaggttat ataaggttet teatgcetgt gaattaatat tattgtgtaa 665 
gaattaagtt aaaaagectg ggctgacttt taaatttata aattcattta teatgtttat 725 
agtatattta ttgtttttct tteatggcta ttaaaaagta tgaetgtaaa ggaeaatgca 785 
agnaaaecaa cttaatactg tattgaataa taagtaeaat ttattatttt actttgaaac 845 
attatgaatt tactttccta ctttttetta gttgttatct atataaattg attaaaaaaa 905 
eattttatgt aentnncatt tectagtaea ggttgagtat ceettatttg a 956 
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<210> 100 

<211> 1041 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 20. .703 



<221> sig_peptide 
<222> 20. . 94 
<223> Von Heijne matrix 
score 3 . 9 

seq ATVGLLMLGVTLP/NS 

<221> polyA_signal 
<222> 1000 . .1005 



„^^^^<221> polyA__site 
'-■>^<222> 1023 . .1041 

|l|<400> 100 

|:ik:agggtcctg catcctacc atg teg atg get gtg gaa acc ttt ggc ttc ttc 52 

U1 J^et Ser Met Ala Val Glu Thr Phe Gly Phe Phe 

4" -25 -20 -15 

hptg gca act gtg ggg ctg ctg atg ctg ggg gtg act ctg cca aac age 100 

ii'i^^Het Ala Thr Val Gly Leu Leu Met Leu Gly Val Thr Leu Pro Asn Ser 

r -^^ -5 1 

:,tac tgg cga gtg tec act gtg eac ggg aac gte ate acc acc aac acc 148 
;:;;'Tyr Trp Arg Val Ser Thr Val His Gly Asn Val He Thr Thr Asn Thr 

l'^ 5 10 15 

tmtc ttc gag aac etc tgg ttt age tgt gee ace gae tec ctg ggc gte 196 
'^sjlle Phe Glu Asn Leu Trp Phe Ser Cys Ala Thr Asp Ser Leu Gly Val 

d| 20 25 30 

giae aac tge tgg gag ttc ccg tec atg ctg gee etc tct ggg tat att 244 
"'"Tyr Asn Cys Trp Glu Phe Pro Ser Met Leu Ala Leu Ser Gly Tyr He 
35 40 45 50 

eag gee tge egg gca etc atg ate acc gee ate etc ctg ggc ttc etc 292 
Gin Ala Cys Arg Ala Leu Met He Thr Ala He Leu Leu Gly Phe Leu 

55 60 65 

ggc etc ttg eta ggc at a gcg ggc ctg cgc tge ace aac att ggg ggc 34 0 

Gly Leu Leu Leu Gly He Ala Gly Leu Arg Cys Thr Asn He Gly Gly 

70 75 80 

ctg gag etc tec agg aaa gee aag ctg gcg gee acc gca ggg gee cec 388 
Leu Glu Leu Ser Arg Lys Ala Lys Leu Ala Ala Thr Ala Gly Ala Pro 

85 90 95 

eac att ctg gee ggt ate tge ggg atg gtg gee ate tec tgg tac gee 436 
His He Leu Ala Gly He Cys Gly Met Val Ala He Ser Trp Tyr Ala 

100 105 110 

ttc aac ate acc egg gae ttc ttc gac cec ttg tac cec gga acc aag 484 
Phe Asn He Thr Arg Asp Phe Phe Asp Pro Leu Tyr Pro Gly Thr Lys 
115 120 125 130 

tac gag ctg ggc cec gee etc tac ctg ggg tgg age gee tea ctg ate 532 
Tyr Glu Leu Gly Pro Ala Leu Tyr Leu Gly Trp Ser Ala Ser Leu He 
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X4 U 








145 




tec 


ate 


ctcr crCT't" crcrn 




uyc_ 


etc 


tgc 


tec 


gcc 


tgc 


tgc tgc 


ggc 


tct 


Ser 


lie 




Leu 




Leu. 


Cys 


Ser 


Ala 


Cys 


Cys Cys 


Gly Ser 






150 
















160 






crac 


cracr 


crac cca crcc 


gcc 


cty 


y ^ 


egg 


egg 


ccc 


tac 


cag get 


cca 


gtg 


Asp 


Glu 


AsD Piro Ala 


Ala 


OCX 




Arg 


Arg 


Pro 


Tyr 


Gin Ala 


Pro 


Val 






165 






170 










175 






tec 


gtg 


atg ccc gtc 


gcc 


acc 


teg 


gae 


caa 


gaa 


ggc 


gae age 


age 


ttt 


Ser 


Val 


Met Pro Val 


Ala 


Thr 


Ser 


Asp 


Gin 


Glu 


Gly 


Asp Ser 


Ser 


Phe 




180 






185 










190 








ggc 


aaa 


tac ggc aga 


aac 


gcc 


tac 


gtg 


tagcagetct ggcccgtggg 




Gly 


Lys 


Tyr Gly Arg 


Asn 


Ala 


Tyr 


Val 















195 200 

ceccgctgtc ttcceactgc cecaaggaga ggggacctgg ccggggccca tteccetata 
gtaacetcag gggccggeea egceecgetc ecgtagccce gceeeggcea cggececgtg 
tettgeacte teatggecce teeaggecaa gaactgetct tgggaagteg catatetcee 
ctctgagget ggatecctca tettctgace etgggttctg ggetgtgaag gggaeggtgt 
ccecgcaegt ttgtattgtg tataaataca ttcattaata aatgeatatt gtgaeegtta 
aaaaaaaaaa aaaaaaaa 



^i^k210> 101 
;;*^<211> 558 
lj<212> DNA 
•sii^<213> Homo sapiens 

j^<220> 
f^<221> CDS 
•;^<222> 103 . .294 

<221> sig_peptide 
;^222> 103 , .243 
^k223> Von Heijne matrix 
■■h score 5 . 9 

q seq TWLGLLSFQNLHC/FP 

;!^400> 101 

ttcceatggt ttagaageat aacetgtaat gtaatgeaag tecectaact ccetggttgc 
taaeattaac ttecttaagt aataateaat gaaagaaatt et atg cat ggt ttt 

Met His Gly Phe 
-45 

gaa ata ata tec ttg aaa gag gaa tea cca tta gga aag gtg agt cag 
Glu He He Ser Leu Lys Glu Glu Ser Pro Leu Gly Lys Val Ser Gin 

-40 -35 -30 

ggt cet ttg ttt aat gtg act agt ggc tea tea tea cca gtg acc tgg 
Gly Pro Leu Phe Asn Val Thr Ser Gly Ser Ser Ser Pro Val Thr Trp 

-25 -20 -15 

ttg ggc eta etc tec ttc cag aac ctg cat tgc ttc cca gae etc ccc 
Leu Gly Leu Leu Ser Phe Gin Asn Leu His Cys Phe Pro Asp Leu Pro 

-10 -5 1 5 

act gag atg cet eta aga gcc aaa gga gtc aac act tgageetagg 
Thr Glu Met Pro Leu Arg Ala Lys Gly Val Asn Thr 

10 15 
gtgggctaca acaaaagatt ctaatttacc ttgcttcatc taggtccagg ccccaagtag 
cttgctgaag gaaettaaaa agtagetgtt atttattgta ttgtataage taaaaacatt 
tatttttgtt gaatcgaaac aattccatgt agcaatettt tttetgttea eggtgtttgt 



-84- 



gatagaacct taaattccgc aagcatcagt tttttgaaaa aatgggaatt gaccggatag 544 
taacaggcaa agtt 553 



<210> 102 
<211> 730 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 81. .518 

<221> sig__peptide 
<222> 81. .173 
<223> Von Heijne matrix 
score 3 . 9 

seq ILFHGVFYAGGFA/IV 
<400> 102 

„^^ctcgtcatgc tctttgtagc gtggtgcttc tgttgctcac aggacaactt gcctttgatg 60 
^^j^attttcaaga gagttgtgct atg atg tgg caa aag tat gca gga age agg egg 113 

Met Met Trp Gin Lys Tyr Ala Gly Ser Arg Arg 

iill -30 -25 



=l^tca 


atg 


cct 


ctg 


gga 


gca 


agg 


ate 


ctt 


tte 


cae 


ggt 


gtg 


tte 


tat 


gcc 


161 


ilSer 


Met 


Pro 


Leu 


Gly 


Ala 


Arg 


He 


Leu 


Phe 


His 


Gly Val 


Phe 


Tyr 


Ala 














-15 










-10 








-5 






ggc 


ttt 


gcc 


att 


gtg 


tat 


tac 


etc 


att 


caa 


aag 


ttt 


cat 


tee 


agg 


209 




Gly 


Phe 


Ala 


He 
1 


Val 


Tyr 


Tyr 


Leu 
5 


He 


Gin 




Phe 


His 


Ser 


Arg 






tta 


tat 


tac 


aag 


ttg 


gca 


gtg 


gag 


cag 


ctg 


cag 


age 


10 
cat 


ecc 


gag 


257 


;^^la 


Leu 


Tyr 


Tyr 


Lys 


Leu 


Ala 


Val 


Glu 


Gin 


Leu 


Gin 


Ser 


His 


Pro 


Glu 








15 










20 










25 










^%ca 


cag 


gaa 


get 


ctg 


ggc 


cct 


cct 


etc 


aac 


ate 


cat 


tat 


etc 


aag 


etc 


305 




Gin 


Glu 


Ala 


Leu 


Gly 


Pro 


Pro 


Leu 


Asn 


He 


His 


Tyr 


Leu 


Lys 


Leu 






30 










35 










40 










;!ktC 


gac 


agg 


gaa 


aac 


tte 


gtg 


gac 


att 


gtt 


gat 


gcc 


aag 


ttg 


aag 


att 


353 


^^^le 


Asp 


Arg 


Glu 


Asn 


Phe 


Val 


Asp 


He 


Val 


Asp 


Ala 


Lys 


Leu 


Lys 


He 




45 










50 










55 






60 




cct 


gtc 


tct 


gga 


tec 


aaa 


tea 


gag 


ggc 


ctt 


etc 


tac 


gtc 


cae 


tea 


tee 


401 


Pro 


Val 


Ser Gly 


Ser 


Lys 


Ser 


Glu 


Gly 


Leu 


Leu 


Tyr Val 


His 


Ser 


Ser 












65 










70 










75 






aga 


ggt 


ggc 


ccc 


ttt 


cag 


agg 


tgg 


cae 


ctt 


gac 


gag 


gtc 


ttt 


tta 


gag 


449 


Arg 


Gly 


Gly Pro 


Phe 


Gin 


Arg 


Trp 


His 


Leu 


Asp 


Glu 


Val 


Phe 


Leu 


Glu 










80 










85 










90 








etc 


aag 


gat 


ggt 


cag 


cag 


att 


cct 


gtg 


tte 


aag 


etc 


agt 


ggg 


gaa 


aac 


497 


Leu 


Lys 


Asp Gly 


Gin 


Gin 


He 


Pro 


Val 


Phe 


Lys 


Leu 


Ser 


Gly Glu Asn 








95 










100 










105 











ggt gat gaa gtg aaa aag gag tagagaegae ccagaagacc cagcttgett 548 
Gly Asp Glu Val Lys Lys Glu 
110 115 

etagtecate cttceetcat etetaecata tggccactgg ggtggtggcc catctcagtg 608 

acagaeactc ctgcaaccea gttttceage caccagtggg atgatggtat gtgccageae 668 

atggtaattt tggtgtaatt etaacttggg cacaaegaat getatttgtc atttttaaae 728 
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<210> 103 
<211> 1098 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 66. .326 

<221> polyA_signal 
<222> 1066. .1071 

<221> polyA__site 
<222> 1087 . . 1098 

<400> 103 

ctccctttga atgagagaaa ctaacccgct tccgaagccc ctgaaagaca ctgctccttc 60 
ctctc atg gag ttg get ccg aca gcc cgt ctg cca cca ggc cat ggt tec 110 
Met Glu Leu Ala Pro Thr Ala Arg Leu Pro Pro Gly His Gly Ser 

^ ^ 10 15 

.Jttg ccc cat ggt gtc ctg gga ccc aga gca aca gga tct gtc acc cac 158 
IjLeu Pro His Gly Val Leu Gly Pro Arg Ala Thr Gly Ser Val Thr His 

20 25 30 

,iiPtc tct ctt etc ccc cag ate aag caa cgt gee tea gag get ttg ccc 206 
j^Leu Ser Leu Leu Pro Gin He Lys Gin Arg Ala Ser Glu Ala Leu Pro 
ij. 40 45 

f"^aa ttg ctt cgt ect gtc ace ccc ate acc aat ttt gag ggc age cag 254 
;;plu Leu Leu Arg Pro Val Thr Pro He Thr Asn Phe Glu Gly Ser Gin 

50 55 60 

^ tct cag gae eac agt gga ate ttt ggc ctg gta aca aac ctg gaa gag 302 
^^^Ser Gin Asp His Ser Gly He Phe Gly Leu Val Thr Asn Leu Glu Glu 
1J 65 70 75 

,,^tg gag gtg gac gat tgg gag ttc tgagcetctg caaactgtgc gcattcteca 356 
Leu Glu Val Asp Asp Trp Glu Phe 
0 85 
;;feccagggatg cagaggecac ceagaggcec ttcetgaggg ccggccacat teecgecctc 416 
'^titgggcagat tgggtagaaa ggacattett ecaggaaagt tgactgctgg ctgattggga 476 
aagaaaatcc tggagagata cttcaetget ecaaggcttt tgagacacaa gggaatctca 536 
acaaccaggg atcaggaggg tccaaagccg acattcceag tectgtgagc tcaggtgacc 596 
tcctccgcag aagagagatg ctgctctggc cctgggagct gaattccaag cccagggttt 656 
ggctccttaa acccgaggac cgccacctet tcccagtget tgegaccagc cteattctae 716 
ttaactttgc tetcagatgc etcagatgct ataggteagt gaaagggcga gtagtaagct 776 
gcctgectcc cttcecteag acctctccct cataattcca gagaagggea tttctgtctt 836 
tttaagcaca gaetaagget ggaacagtcc atccttatcc ctcttetggc ttgggccctg 8 96 
acacctaagt ctttcecaeg gtttatgtgt gtgecteatt ccttteccae caagaatcca 956 
tcttagcgcc tcctgccagc tgccetggtg ctttctceaa gggccatcag tgtcttgcct 1016 
agcttgaggg cttaagtcct tatgetgtgt tagtttcgtt gteagaacaa attaaaattt 1076 
tcagagacgc aaaaaaaaaa aa 1098 



<210> 104 
<211> 346 
<212> DNA 
<213> Homo sapiens 
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<220> 
<221> CDS 
<222> 170, .289 

<221> sig_j)eptide 
<222> 170. .250 
<223> Von Heijne matrix 
score 3 . 6 

seq LTLLLITPSPSPL/LF 
<400> 104 

ccatttgagc cccaccacgg aggttatgtg gtcccaaaag gaatgatggc caagcaatta 60 
atttttcctc ctagttctta gcttgcttct gcattgattg gctttacaca actggcattt 120 
agtctgcatt acacaaatag acactaattt atttggaaca agcagcaaa atg aga act 178 

Met Arg Thr 
-25 

tta ttt ggt gca gtc agg get cca ttt agt tec etc act ctg ctt eta 226 
Leu Phe Gly Ala Val Arg Ala Pro Phe Ser Ser Leu Thr Leu Leu Leu 

-20 -15 -10 

ate aec eet tet ecc age ect ctt eta ttt gat aga ggt ctg tee etc 274 
^^Jle Thr Pro Ser Pro Ser Pro Leu Leu Phe Asp Arg Gly Leu Ser Leu 

i i 5 

j,?aga tea gea atg tet tageccctct cetetettcc attccttcct gttggtaetc 329 
l,|Arg Ser Ala Met Ser 

'4^ 10 

flatttettcta aetttta 



''<210> 105 
' <211> 685 
i;^'>:212> DNA 
i^J<213> Homo sapiens 

|p221> CDS 
:V^222> 36 . .497 

<221> polyA_signal 
<222> 650. .655 

<221> polyA_site 
<222> 663 . .685 

<400> 105 

aagttctgcg ctggtcggeg gagtagcaag tggee atg ggg age etc age ggt 53 

Met Gly Ser Leu Ser Gly 
1 5 

ctg egc ctg gca gea gga age tgt ttt agg tta tgt gaa aga gat gtt 101 
Leu Arg Leu Ala Ala Gly Ser Cys Phe Arg Leu Cys Glu Arg Asp Val 

10 15 20 

tec tea tet eta agg ctt ace aga age tet gat ttg aag aga ata aat 14 9 

Ser Ser Ser Leu Arg Leu Thr Arg Ser Ser Asp Leu Lys Arg He Asn 

25 30 35 

gga ttt tge aca aaa eca cag gaa agt ece gga get cca tee egc act 197 
Gly Phe Cys Thr Lys Pro Gin Glu Ser Pro Gly Ala Pro Ser Arg Thr 
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o -h f-« 
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Leu 
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Se2r 
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vjIU 
D n 


Asp 
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gtc 
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^ L« U 
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gea 


o U 

aag 


aae 


aag 
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Val 


SejT 


Leu 


Glu 


1*1 d L> 
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A J. a 


Aia 
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90 




















100 


acre 


age 


tat 


eta 


atg 


att 






acg 


gtg 


gua 


gga 


tgc 


ate 


SeiT 




lyr 


Leu 




±xe 


A_La 


Leu 


Thr 


Val 


Val 


Gly 


Cys 


He 






1 rm 

X VJ o 










110 








115 




gtt 


att 


gag 


ggc 


aag 


aag 


get 


gee 


caa 


aga 


cae 


gag 


act 


tta 


Val 


He 


Glu 


Gly 


Lys 


Lys 


Ala 


Ala 


Gin 


Arg 


His 


Glu 


Thr 


Leu 




120 










125 










130 






ttg 


aac 


tta 


gaa 


aag 


aaa 


get 


egt 


etg 


aaa 


gag 


gaa 


gea 


get 


Leu 


Asn 


Leu 


Glu 


Lys 


Lys 


Ala 


Arg 


Leu 


Lys 


Glu 


Glu 


Ala 


Ala 


135 










140 










145 









70 



85 



245 



293 



341 



389 



age 437 



485 



150 

gee aaa aca gag tagcagaggt atecgtgttg getggatttt gaaaatceag 537 
Ala Lys Thr Glu 

^..^^gaattatgtt ataacgtgce tgtattaaaa aggatgtggt atgaggatee attteataaa 597 

=^j^tatgatttg cecaaaeetg taccatttec gtatttetgc cgtagaagta gaaataaatt 657 

J:lttettaaaaa aaaaaaaaaa aaaaaaaa ^35 



j;;:<2io> 

^;^<211> 

ri!^2i2> 

"^^213> 



106 
554 
DNA 
Homo 



sapiens 



J^\22 0> 
ilk221> CDS 
IH^222> 18 . .32 0 

:.JJ:221> polyA_signal 
ii;jj|=:222> 539 . ,544 

<221> polyA_site 
<222> 542. .554 



50 



98 



<400> 106 

aaccgtegtg gggaagg atg gtg tgc gaa aaa tgt gaa aag aaa ctt ggt 

Met Val Cys Glu Lys Cys Glu Lys Lys Leu Gly 
15 10 
act gtt ate act eca gat aca tgg aaa gat ggt get agg aat ace aea 
Thr Val He Thr Pro Asp Thr Trp Lys Asp Gly Ala Arg Asn Thr Thr 

15 20 25 

gaa agt ggt gga aga aag etg aat aaa aat aaa get ttg act tea aaa 
Glu Ser Gly Gly Arg Lys Leu Asn Lys Asn Lys Ala Leu Thr Ser Lys 

30 35 40 

aaa gea aga ttt gat eca tat gga aag aat aag tte tec act tgt aga 194 
Lys Ala Arg Phe Asp Pro Tyr Gly Lys Asn Lys Phe Ser Thr Cys Arg 

45 50 55 

att tgt aaa agt tet gtg cae caa eca ggt tet eat tac tgc eag ggc 242 
He Cys Lys Ser Ser Val His Gin Pro Gly Ser His Tyr Cys Gin Gly 



146 
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60 65 70 75 

tgt gcc tac aaa aaa ggc ate tgt gcg atg tgt ggn aaa aaa gtt ttg 290 

Cys Ala Tyr Lys Lys Gly He Cys Ala Met Cys Gly Lys Lys Val Leu 

80 85 90 

gat acc aaa aac tac aag caa aca tct gtc tagatgtatt gatggaattt 340 
Asp Thr Lys Asn Tyr Lys Gin Thr Ser Val 

95 100 
ctggctttct aaatgatttt actttctgcc ttgaattttc aaggcataga tgtcaactta 4 00 
cagaataaca tgttttaaga taattaagtt taaaccagag aatttgattg ttactcattt 460 
tgctctcatg ttctaaacag caacagtgta actagtcttt tgttgtaaat ggttattttc 52 0 
cttataagaa ttttaagaac taaaaaaaaa aaaa 554 



<210> 107 

<211> 1678 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
^^^^<222> 71. . 1438 

l!l<221> sig_peptide 

lj<222> 71. .136 

4;<223> Von Heijne matrix 

Jl score 3 . 5 

j;: seq AAPVAAGLGPVIS/RP 

'p5<221> polyA_signal 
^'^<222> 1644. .1649 

;f^<221> polyA_site 
'll<222> 1665. .1678 

^^J<400> 107 

i;|ccgacttcca gaggagcgct gtgcacgtgg agaagagcgg ggactcggcg accctgccct 60 
j|cccgaccctc atg ttc gaa gag act gag tgg gcc gag gcg gcc cca gta 109 
Met Phe Glu Glu Pro Glu Trp Ala Glu Ala Ala Pro Val 
-20 -15 -10 

gcc gcg ggc ctt ggg ccc gta ate tea cga cct ccg cct gcg gcc tec 157 
Ala Ala Gly Leu Gly Pro Val He Ser Arg Pro Pro Pro Ala Ala Ser 

-5 1 5 

teg caa aac aag ggc tec aag cgc cgc cag etc ttg gee aca tta egg 205 
Ser Gin Asn Lys Gly Ser Lys Arg Arg Gin Leu Leu Ala Thr Leu Arg 

10 15 20 

gee eta gag gea gca tct ctt tee cag eat ccc ccc age eta tgt ata 253 
Ala Leu Glu Ala Ala Ser Leu Ser Gin His Pro Pro Ser Leu Cys He 

25 30 35 

agt gac tct gag gag gag gag gag gaa agg aag aag aaa tge ccc aaa 301 
Ser Asp Ser Glu Glu Glu Glu Glu Glu Arg Lys Lys Lys Cys Pro Lys 
40 45 50 55 

aag gca tea ttt gcc agt gcc tct get gaa gta ggg aag aaa ggg aag 349 
Lys Ala Ser Phe Ala Ser Ala Ser Ala Glu Val Gly Lys Lys Gly Lys 

SO 65 70 

aag aaa tgt caa aaa eag ggc cca cet tgc agt gae tct gag gaa gaa 397 
Lys Lys Cys Gin Lys Gin Gly Pro Pro Cys Ser Asp Ser Glu Glu Glu 
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75 80 85 

gta gaa agg aag aag aaa tgc cac aaa cag get ctt gtt ggc agt gac 445 
Val Glu Arg Lys Lys Lys Cys His Lys Gin Ala Leu Val Gly Ser Asp 

90 95 100 

tct get gaa gat gag aaa aga aag agg aaa tgc cag aaa cat gee cet 493 
Ser Ala Glu Asp Glu Lys Arg Lys Arg Lys Cys Gin Lys His Ala Pro 

105 110 115 

ata aat tea gcc cag eac etg gac aat gtt gac caa aca ggt ecc aaa 541 
He Asn Ser Ala Gin His Leu Asp Asn Val Asp Gin Thr Gly Pro Lys 
120 125 130 135 

gee tgg aag ggt agt act aca aat gat cca cca aag caa age cct ggg 589 
Ala Trp Lys Gly Ser Thr Thr Asn Asp Pro Pro Lys Gin Ser Pro Gly 

140 145 150 

tec act tee cct aaa ccc cet cat aca tta age cgc aag cag tgg egg 637 
Ser Thr Ser Pro Lys Pro Pro His Thr Leu Ser Arg Lys Gin Trp Arg 

155 160 165 

aac egg caa aag aat aag aga aga tgt aag aac aag ttt cag cca cct 685 
Asn Arg Gin Lys Asn Lys Arg Arg Cys Lys Asn Lys Phe Gin Pro Pro 

170 175 180 

cag gtg cca gac cag gcc cca get gag gcc ccc aca gag aag aca gag 733 
Gin Val Pro Asp Gin Ala Pro Ala Glu Ala Pro Thr Glu Lys Thr Glu 
r^, 190 195 

;==;;^tg tct cct gtt ccc agg aca gac age cat ggg get egg gca ggg get 781 

:f^:^al Ser Pro Val Pro Arg Thr Asp Ser His Gly Ala Arg Ala Gly Ala 

^^^200 205 210 215 

Httg ega gee cgc atg gca eag egg etg gat ggg gee cga ttt cgc tac 82 9 

^jlLeu Arg Ala Arg Met Ala Gin Arg Leu Asp Gly Ala Arg Phe Arg Tyr 

J" 220 225 230 

Ij^jctc aat gaa cag ttg tac tea ggg ccc age agt get gca cag cgt etc 877 

l^eu Asn Glu Gin Leu Tyr Ser Gly Pro Ser Ser Ala Ala Gin Arg Leu 

235 24 0 24 5 

:,^tc eag gaa gae cct gag get ttt ctt etc tac cac cgc ggc ttc eag 925 
:,, Phe Gin Glu Asp Pro Glu Ala Phe Leu Leu Tyr His Arg Gly Phe Gin 
rf 255 260 

ir^^ge caa gtg aag aag tgg cca ctg cag cca gtg gae cgc ate gee agg 973 
^JSer Gin Val Lys Lys Trp Pro Leu Gin Pro Val Asp Arg He Ala Arg 
d;l 265 270 275 

o;^at ctt cgc eag egg cct gea tec eta gtg gtg get gac ttc ggc tgt 1021 
Asp Leu Arg Gin Arg Pro Ala Ser Leu Val Val Ala Asp Phe Gly Cys 
280 285 290 295 

ggg gat tgc cgc ttg get tea agt ate egg aac cct gtg cat tgc ttt 1069 
Gly Asp Cys Arg Leu Ala Ser Ser He Arg Asn Pro Val His Cys Phe 

300 305 310 

gac ttg get tct ctg gac cct agg gte act gtg tgt gac atg gcc cag 1117 
Asp Leu Ala Ser Leu Asp Pro Arg Val Thr Val Cys Asp Met Ala Gin 

315 320 325 

gtt cet ttg gag gat gag tet gtg gat gtg get gtg ttt tgc ctt tea 1165 
Val Pro Leu Glu Asp Glu Ser Val Asp Val Ala Val Phe Cys Leu Ser 

330 335 340 

ctg atg gga ace aac ate agg gae ttc eta gag gag gca aat aga gta 1213 
Leu Met Gly Thr Asn He Arg Asp Phe Leu Glu Glu Ala Asn Arg Val 

345 350 355 

etg aag cca ggg ggt etc ctg aaa gtg get gag gte age age cgc ttt 1261 
Leu Lys Pro Gly Gly Leu Leu Lys Val Ala Glu Val Ser Ser Arg Phe 
360 365 370 375 

gag gat gtt ega ace ttt etg egg get gtg ace aag eta ggc ttc aag 1309 
Glu Asp Val Arg Thr Phe Leu Arg Ala Val Thr Lys Leu Gly Phe Lys 
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380 385 390 

att gtc tec aag gac ctg acc aac age cat ttc ttc ttg ttt gat ttc 1357 
He Val Ser Lys Asp Leu Thr Asn Ser His Phe Phe Leu Phe Asp Phe 

395 400 405 

caa aag act ggg ccc cct ctg gta ggg ccc aag get eag ctt tea ggc 1405 
Gin Lys Thr Gly Pro Pro Leu Val Gly Pro Lys Ala Gin Leu Ser Gly 

410 415 420 

ctg eag ctt eag cca tgt etc tac aag cge agg tgaeetctgg atettcettg 1458 
Leu Gin Leu Gin Pro Cys Leu Tyr Lys Arg Arg 

425 430 
agaggggagg eagatetcaa actceagget eagaactgtg aagactgttt ccggcctggc 1518 
tgtgagceaa gacctggttc etggtggacc ctgaggacaa agtgtgataa aaectctggc 1578 
tcagacttgc tetactgaag gcttcttggt tataagatgc ataaagtcac tggggctagc 1638 
taaacaataa agagtttatt gtgaggaaaa aaaaaaaaaa 1678 



<210> 108 
<211> 494 
<212> DNA 
<213> Homo sapiens 

=^^<220> 
fJ<221> CDS 
1J<222> 25. .318 

J1<221> sig_peptide 
|^<222> 25. .75 
;K223> Von Heijne matrix 
ji score 7 . 4 

seq FFLLLQFFLRIDG/VL 

^''<221> polyA_signal 
^J<222> 452. .457 

^k221> polyA_site 
J;k222> 482 . .494 

''<400> 108 

aggctgagtg tgaagattag agta atg cct tet age ttt ttc ctg ctg ttg 51 

Met Pro Ser Ser Phe Phe Leu Leu Leu 
-15 -10 
cag ttt ttc ttg aga att gat ggg gtg ett ate aga atg aat gac acg 99 
Gin Phe Phe Leu Arg He Asp Gly Val Leu He Arg Met Asn Asp Thr 

-5 15 
aga ett tac cat gag get gac aag acc tac atg tta ega gaa tat acg 14 7 

Arg Leu Tyr His Glu Ala Asp Lys Thr Tyr Met Leu Arg Glu Tyr Thr 

10 15 20 

tea ega gaa age aaa att tet agt ttg atg eat gtt cca cct tec etc 195 
Ser Arg Glu Ser Lys He Ser Ser Leu Met His Val Pro Pro Ser Leu 
25 30 35 40 

ttc acg gaa ect aat gaa ata tec cag tat tta cca ata aag gaa gea 243 
Phe Thr Glu Pro Asn Glu He Ser Gin Tyr Leu Pro He Lys Glu Ala 

45 50 55 

gtt tgt gag aag eta ata ttt cca gaa aga att gat ect aac cca gca 291 
Val Cys Glu Lys Leu He Phe Pro Glu Arg He Asp Pro Asn Pro Ala 
60 65 7Q 
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gac tea caa aaa agt aca caa gtg gaa taaaatgtga tacaacatat 338 
Asp Ser Gin Lys Ser Thr Gin Val Glu 

75 80 

actcactatg gaatctgact ggacaccttg gctatttgta aggggttatt tttattatga 3 98 

gaattaattg ccttgtttat gtacagattt tctgtagcct taaaggaaaa aaaaataaag 458 

atcgttacag gcaggtttca ctcaaaaaaa aaaaac 494 



<210> 109 
<211> 714 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 84. .332 

<221> sig_peptide 
<222> 84 . . 170 
<223> Von Heijne matrix 
score 5.2 

seq PCYYLGLFQRALA/SV 

l,k221> polyA^site 
d^222> 702 . . 714 

j;^400> 109 

;.pctatctctt ctgctggctg ggctcaatgc cgcgggtgag cgttcggccg aggctgctcc 60 
j|:acccttgag tgatgtgcct tga atg acg ctg ctt tea ttc get get ttc acg 113 

Met Thr Leu Leu Ser Phe Ala Ala Phe Thr 
- -25 -20 

;5ct get ttc tec gtc etc ccc tgt tac tac ctt ggg ctg ttt cag egg 161 
ijvia Ala Phe Ser Val Leu Pro Cys Tyr Tyr Leu Gly Leu Phe Gin Arg 

-15 -10 -5 

.,gcg etc gcg teg gtc ttc gac eca ctt tge gtt tgt tea cgt gtg etc 209 
jjAla Leu Ala Ser Val Phe Asp Pro Leu Cys Val Cys Ser Arg Val Leu 
Vl 1 5 10 

beg aca cct gta tgt ace ttg gtc gca aca caa gee gaa aaa ata tta 257 
Pro Thr Pro Val Cys Thr Leu Val Ala Thr Gin Ala Glu Lys He Leu 

15 20 25 

gag aat ggg ccc tgt cea ace aag gag geg gee cag ett gtc ggg aag 305 
Glu Asn Gly Pro Cys Pro Thr Lys Glu Ala Ala Gin Leu Val Gly Lys 
30 35 40 45 

ggc age gtt tee gee aga aat get teg tgaaaggeae ttgagggaec 352 
Gly Ser Val Ser Ala Arg Asn Ala Ser 
50 

ttageageat cctcaaeagg ecttgtaggg aatgecagaa gaageagtee ttggccggge 412 
ggggtggcte atgcetgtgg teceageact ttgggaggce ggggegggcg gatcaectga 472 
ggtcgggagg tccagaecag cetgaeegac atggagaaac ceegtetnta etagaaatac 532 
aaaaetagcc gggtgtggtg gegeatgcet gtagteccag ctaeteggga gggtgaggca 592 
ggagacgtte ttgaaccegg gaggeggagt ttgtggtgag ccgagatcge gecattgeac 652 
tecagcetgg geatgecaag agegaaaetc egtcteaaaa aaaaaaaaga aaaaaaaaaa 712 

714 
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<210> 110 
<211> 805 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 32. .718 



<221> sig_peptide 
<222> 32 . .100 
<223> Von Heijne matrix 
score 7.4 

seq VLLLAALPPVLLP/GA 

<221> polyA_signal 
<222> 770. .775 



<221> polyA_site 
<222> 793 . . 805 



.^^^<400> 110 

i=|lcctctttcag cccgggatcg ccccagcagg g atg ggc gac aag ate tgg ctg 52 
:f^f Met Gly Asp Lys lie Trp Leu 

nj -20 

Mccc ttc ccc gtg etc ctt ctg gcc get ctg cct ccg gtg ctg ctg cct 100 
Upro Phe Pro Val Leu Leu Leu Ala Ala Leu Pro Pro Val Leu Leu Pro 
j;: -10 -5 

iijjggg gcg gcc ggc ttc aca cct tec etc gat age gac ttc acc ttt ace 148 

Ala Ala Gly Phe Thr Pro Ser Leu Asp Ser Asp Phe Thr Phe Thr 
''^1 5 10 15 

ctt ccc gee ggc cag aag gag tgc ttc tac cag ccc atg ccc ctg aag 196 
r^eu Pro Ala Gly Gin Lys Glu Cys Phe Tyr Gin Pro Met Pro Leu Lys 
iiy 20 25 30 

h^cc teg ctg gag ate gag tac caa gtt tta gat gga gea gga tta gat 244 
:.^|?^la Ser Leu Glu He Glu Tyr Gin Val Leu Asp Gly Ala Gly Leu Asp 
yj ^5 40 45 

,:;;^tt gat ttc cat ctt gcc tct cca gaa ggc aaa acc tta gtt ttt gaa 292 
■■^^le Asp Phe His Leu Ala Ser Pro Glu Gly Lys Thr Leu Val Phe Glu 
50 55 60 

caa aga aaa tea gat gga gtt cae act gta gag act gaa gtt ggt gat 340 
Gin Arg Lys Ser Asp Gly Val His Thr Val Glu Thr Glu Val Gly Asp 

70 75 80 

tac atg ttc tgc ttt gac aat aca ttc age acc att tct gag aag gtg 
Tyr Met Phe Cys Phe Asp Asn Thr Phe Ser Thr He Ser Glu Lys Val 

85 90 95 

att ttc ttt gaa tta ate ctg gat aat atg gga gaa 

He Phe Phe Glu Leu He Leu Asp Asn Met Gly Glu Gin Ala Gin Glu 

100 105 110 

caa gaa gat tgg aag aaa tat att act ggc aca gat ata ttg gat atg 484 
Gin Glu Asp Trp Lys Lys Tyr He Thr Gly Thr Asp He Leu Asp Met 

115 120 125 

aaa ctg gaa gac ate ctg gaa tec ate age age ate aag tee aga eta 532 
Lys Leu Glu Asp He Leu Glu Ser He Ser Ser He Lys Ser Arg Leu 

130 135 140 

age aaa agt ggg cac ata caa att ctg ett aga gea ttt gaa get egt 580 
Ser Lys Ser Gly His He Gin He Leu Leu Arg Ala Phe Glu Ala Arg 



388 



cag gea caa gaa 436 
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150 155 160 

gat cga aac ata caa gaa age aac ttt gat aga gtc aat ttc tgg tct 628 
Asp Arg Asn He Gin Glu Ser Asn Phe Asp Arg Val Asn Phe Trp Ser 

1^5 170 175 

atg gtt aat tta gtg gtc atg gtg gtg gtg tea gcc att caa gtt tat 676 
Met Val Asn Leu Val Val Met Val Val Val Ser Ala He Gin Val Tyr 

180 185 190 

atg ctg aag agt ctg ttt gaa gat aag agg aaa agt aga act 718 
Met Leu Lys Ser Leu Phe Glu Asp Lys Arg Lys Ser Arg Thr 

1^5 200 205 

taaaactcca aactagagta cgtaacattg aaaaatgagg cataaaaatg caataaactg 778 
ttacagtcaa gaccaaaaaa aaaaaaa 3Q5 



<210> 111 
<211> 787 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
C!=<222> 26. .481 

fy<221> sig_j)eptide 
M^<222> 26. .88 
|j^l<223> Von Heijne matrix 
jf score 4 . 4 

seq AVASSFFCASLFS/AV 

'''^<22X> polyA_signal 
<222> 755. .760 

nj<221> polyA_site 
^ii<222> 775. .787 

;j'f:400> 111 

jiygacagcctgg ataaaggctc acttg atg get 

Met Ala 
-20 



gtg 


get 


tec 


agt 


ttc 


ttt 


tgt 


gca 


tet 


Val 


Ala 


Ser 


Ser 


Phe 


Phe 


Cys 


Ala 


Ser 






-10 










-5 




ata 


gaa 


gag 


gga 


cat 


att 


ggg 


gta 


tat 


lie 


Glu 


Glu Gly 


His 


He 


Gly 


Val 


Tyr 


5 










10 








act 


teg 


ace 


age 


ggc 


ect 


ggt 


ttc 


cat 


Thr 


Ser 


Thr 


Ser 


Gly 


Pro 


Gly 


Phe 


His 










25 










tea 


tat 


aag 


tet 


gtg 


cag 


ace 


aca 


etc 


Ser 


Tyr 


Lys 


Ser 


Val 


Gin 


Thr 


Thr 


Leu 








40 










45 


gta 


ect 


tgt 


ggg 


act 


agt 


ggt 


ggt 


gtg 


Val 


Pro 


Cys 


Gly 


Thr 


Ser 


Gly 


Gly Val 






55 










60 




gaa 


gtg 


gtg 


aac 


ttc 


ctg 


gtc 


ccg 


aac 


Glu 


Val 


Val 


Asn 


Phe 


Leu 


Val 


Pro 


Asn 



eag ttg gga gca gtt gtg get 52 
Gin Leu Gly Ala Val Val Ala 
-15 

etc ttc tea get gtg eac aag lOO 
Leu Phe Ser Ala Val His Lys 
1 

tac aga ggc ggt gee ctg ctg 148 
Tyr Arg Gly Gly Ala Leu Leu 

15 20 
etc atg etc ect ttc ate aca 196 
Leu Met Leu Pro Phe He Thr 
30 35 
cag aca gat gag gtg aag aat 244 
Gin Thr Asp Glu Val Lys Asn 
50 

atg ate tac ttt gac aga att 292 
Met He Tyr Phe Asp Arg He 
65 

gca gtg cat gat ata gtg aag 340 
Ala Val His Asp He Val Lys 
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388 



70 75 80 

aac tat act get gac tat gac aag gcc etc ate tte aac aag ate cac 
Asn Tyr Thr Ala Asp Tyr Asp Lys Ala Leu He Phe Asn Lys He His 
85 90 95 100 

cac gaa ctg aac cag tte tgc agt gtg cae aeg ctt caa gag gtc tae 436 
His Glu Leu Asn Gin Phe Cys Ser Val His Thr Leu Gin Glu Val Tyr 

105 110 115 

att gag ctg ttt gga ctg gaa aat gat ttt tec cag gaa tct tea 481 
lie Glu Leu Phe Gly Leu Glu Asn Asp Phe Ser Gin Glu Ser Ser 

120 125 130 

taaaagggae cetgagcaag aacatttttc atagcagaea ggaggactea tceacategc 541 
cagcaatcat aattaagcaa accgcctttt gcaecattta agatttagga aateatecaa 601 
attactttta atgtttctge agtagaaaat gaatctaaat tcattttata gggtttgtag 661 
tcttttatct gttttggatt cactgtgctt ttaagaaaaa gttggtaaat ttgccgttga 721 
tttttctttt taaccteaaa ctaatagaat tttataaaat attaattttc tccaaaaaaa 781 



aaaaaa 



787 



<210> 112 
<211> 569 
<212> DNA 
^Ij<213> Homo sapiens 

IJ 

lj<220> 
.j^<221> CDS 
f|<222> 26 . . 562 

;5<221> sig_peptide 
"^222> 26. .187 
^''<223> Von Heijne matrix 
score 4 . 1 

seq AWAAAARTGSEA/RV 



-k400> 112 

=^,^kgaaacaggt ctgggctaca aaagt atg gcc get tct gag gcg gcg gtg gtg 52 
j!l J^et Ala Ala Ser Glu Ala Ala Val Val 



-50 



100 



"\ot teg ccg tct ttg aaa aca gac aca tee cet gtc ctt gaa act gca 
Ser Ser Pro Ser Leu Lys Thr Asp Thr Ser Pro Val Leu Glu Thr Ala 
"^^ -40 -35 -30 

gga acg gtc gca gca atg get gcg ace ccg tea gca agg get gca gee 148 
Gly Thr Val Ala Ala Met Ala Ala Thr Pro Ser Ala Arg Ala Ala Ala 

-25 -20 -15 

gcg gtg gtt gcg gee gcg gcc agg ace gga tec gaa gcc agg gtc tec 196 
Ala Val Val Ala Ala Ala Ala Arg Thr Gly Ser Glu Ala Arg Val Ser 

-10 -5 1 

aag gcc get ttg get ace aag ctg ctg tec ttg age gge gtg tte gcc 244 
Lys Ala Ala Leu Ala Thr Lys Leu Leu Ser Leu Ser Gly Val Phe Ala 

5 10 15 

gtg cae aag ecc aaa ggg cce act tea gcc gag ctg ctg aat egg ttg 292 
Val His Lys Pro Lys Gly Pro Thr Ser Ala Glu Leu Leu Asn Arg Leu 
20 25 30 35 

aag gag aag ctg ctg gca gaa get gga atg ect tct eca gaa tgg ace 340 
Lys Glu Lys Leu Leu Ala Glu Ala Gly Met Pro Ser Pro Glu Trp Thr 

40 45 50 

aag agg aaa aag cag act ttg aaa att ggg cat gga ggg act eta gac 388 
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Lys Arg Lys Lys Gin Thr Leu Lys He 

55 60 
age gca gcc cga gga gtt ctg gtt gtt 
Ser Ala Ala Arg Gly Val Leu Val Val 

70 75 
atg ttg acc agt atg ttg tea ggg tec 
Met Leu Thr Ser Met Leu Ser Gly Ser 

85 90 
gaa ctg ggg aaa get act gat aea eta 
Glu Leu Gly Lys Ala Thr Asp Thr Leu 
100 105 
gaa gaa aaa cet tac ggt atg aae etc 
Glu Glu Lys Pro Tyr Gly Met Asn Leu 
120 



Gly His Gly Gly Thr Leu Asp 
65 

gga att gga age gga aea aaa 436 
Gly He Gly Ser Gly Thr Lys 
80 

aag agg tat act gee att gga 484 
Lys Arg Tyr Thr Ala He Gly 
95 

gat tct aeg ggg aag gta aca 532 
Asp Ser Thr Gly Lys Val Thr 
110 115 
ate taagtag S69 
He 
125 



<210> 113 
<211> 893 
<212> DNA 
<213> Homo sapiens 

D<22 0> 
j;i<221> CDS 
nj<222> 4 . .810 

yi|<221> sigjpeptide 
]^<222> 4. .279 
i^'^<223> Von Heijne matrix 
score 6.8 

seq AVMLYTWRSCSRA/IP 

H^221> polyA_signal 
nj<222> 858 . .863 

^.j<221> polyA^site 
,;if^<222> 881 . . 893 

'^^4 00> 113 

gee atg ate aeg eac gte acc ctg gaa gat gee ctg tec aac gtg gae 48 
Met He Thr His Val Thr Leu Glu Asp Ala Leu Ser Asn Val Asp 
-90 -85 -80 

ctg ctt gaa gag ett cee etc ecc gae cag cag eca tgc ate gag cet 96 
Leu Leu Glu Glu Leu Pro Leu Pro Asp Gin Gin Pro Cys He Glu Pro 

-75 -70 -65 

eca cet tec tec ate atg tac cag get aae ttt gae aca aac ttt gag 144 
Pro Pro Ser Ser He Met Tyr Gin Ala Asn Phe Asp Thr Asn Phe Glu 

-^0 -55 -50 

gae agg aat gca ttt gte aeg gge att gca agg tac att gag cag get 192 
Asp Arg Asn Ala Phe Val Thr Gly He Ala Arg Tyr He Glu Gin Ala 

-40 -35 -30 

aea gte eac tee age atg aat gag atg ctg gag gaa gga cat gag tat 240 
Thr Val His Ser Ser Met Asn Glu Met Leu Glu Glu Gly His Glu Tyr 

-25 -20 -15 

gcg gte atg ctg tac acc tgg cge age tgt tec egg gcc att ecc cag 288 
Ala Val Met Leu Tyr Thr Trp Arg Ser Cys Ser Arg Ala He Pro Gin 
-10 -5 1 
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gtg aaa tgc aac gag cag ccc aac cga gta gag ate tat gag aag aca 336 
Val Lys Cys Asn Glu Gin Pro Asn Arg Val Glu He Tyr Glu Lys Thr 

5 10 15 

gta gag gtg ctg gag ccg gag gtc acc aag etc atg aag ttc atg tat 384 
Val Glu Val Leu Glu Pro Glu Val Thr Lys Leu Met Lys Phe Met Tyr 
20 25 30 35 

ttt cag cgc aag gee ate gag egg ttc tgc age gag gtg aag egg ctg 432 
Phe Gin Arg Lys Ala He Glu Arg Phe Cys Ser Glu Val Lys Arg Leu 

40 45 50 

tgc cat gcc gag cgc agg aag gac ttt gtc tct gag gee tac etc ctg 480 
Cys His Ala Glu Arg Arg Lys Asp Phe Val Ser Glu Ala Tyr Leu Leu 

55 60 65 

acc ett gge aag ttc ate aac atg ttt get gtc ctg gat gag eta aag 528 
Thr Leu Gly Lys Phe He Asn Met Phe Ala Val Leu Asp Glu Leu Lys 

70 75 80 

aac atg aag tgc age gtc aag aat gac cac tec gee tac aag agg gea 576 
Asn Met Lys Cys Ser Val Lys Asn Asp His Ser Ala Tyr Lys Arg Ala 

85 90 95 

gea cag ttc ctg egg aag atg gea gat ccc cag tct ate cag gag teg 624 
Ala Gin Phe Leu Arg Lys Met Ala Asp Pro Gin Ser He Gin Glu Ser 
100 105 110 115 

....^^^ ttc ctg gee aac cac aac agg ate ace cag tgt 672 

H.JGln Asn Leu Ser Met Phe Leu Ala Asn His Asn Arg He Thr Gin Cys 

120 125 130 

njcte cac cag caa ett gaa gtg ate eca gge tat gag gag ctg ctg get 720 
i.jiLeu His Gin Gin Leu Glu Val He Pro Gly Tyr Glu Glu Leu Leu Ala 
U1 140 145 

Ij^ac att gtc aac ate tgt gtg gat tac tac gag aac aag atg tac ctg 768 
^j^sp He Val Asn He Cys Val Asp Tyr Tyr Glu Asn Lys Met Tyr Leu 
ifl 150 155 160 

-^act ccc agt gag aaa cat atg etc etc aag gta aaa etc ccc 810 
^\Thr Pro Ser Glu Lys His Met Leu Leu Lys Val Lys Leu Pro 

170 175 
ritgaggccgca cecatggagc ctgggettac cetctcacet tettcttatt aaaaatccgt 870 
|;::^tttaaaaaac aaaaaaaaaa aaa gg-^ 



'v:210> 114 
<211> 1475 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 55. .459 



<221> sig_jpeptide 
<222> 55. .120 
<223> Von Heijne matrix 
score 7.2 

seq GLWLALVDGLVRS/SP 

<221> polyA_signal 
<222> 1444 . .1449 



<221> polyA_site 
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<222> 1462. .1475 



<400> 114 

cagttccgca gctacgtgtg ggacccgctg ctgatcctgt cgcagatcgt cctc atg 57 

Met 

cag acc gtg tat tac ggc teg ctg ggc ctg tgg ctg gcg ctg gtg gac 105 
Gin Thr Val Tyr Tyr Gly Ser Leu Gly Leu Trp Leu Ala Leu Val Asp 

-20 -15 -10 

ggg eta gtg cga age age ccc teg ctg gac cag atg ttc gac gcc gag 153 
Gly Leu Val Arg Ser Ser Pro Ser Leu Asp Gin Met Phe Asp Ala Glu 
-^15 10 
ate ctg ggc ttt tec ace cet eca ggc egg etc tee atg atg tee ttc 201 
He Leu Gly Phe Ser Thr Pro Pro Gly Arg Leu Ser Met Met Ser Phe 

15 20 25 

ate ttc aac gee etc acc tgt gee ctg ggc ttg ctg tac ttc ate egg 249 
He Phe Asn Ala Leu Thr Cys Ala Leu Gly Leu Leu Tyr Phe He Arg 

30 35 40 

cga gga aag cag tgt ctg gat ttc act gte act gtc eat ttc ttt cac 297 
Arg Gly Lys Gin Cys Leu Asp Phe Thr Val Thr Val His Phe Phe His 

45 50 55 

etc ctg ggc tge tgg ttc tac age tec egt ttc ccc teg gcg ctg ace 345 
^^,,Leu Leu Gly Cys Trp Phe Tyr Ser Ser Arg Phe Pro Ser Ala Leu Thr 

65 70 75 

i^99 tgg ctg gtc caa gee gtg tgc att gea etc atg get gtc ate ggg 393 
•^JTrp Trp Leu Val Gin Ala Val Cys He Ala Leu Met Ala Val He Gly 
It 85 90 

Lfigag tac ctg tgc atg egg acg gag etc aag gag ata ccc etc aac tea 441 
I^Glu Tyr Leu Cys Met Arg Thr Glu Leu Lys Glu He Pro Leu Asn Ser 
Jj 95 100 105 

Pigec cet aaa tec aat gte tagaatcagg ccctttggac ateccgetga 489 
;"Ala Pro Lys Ser Asn Val 
110 

:^:caettgggec ecttaacace ttgggctgct cagaccctce agatgaggte cagcecagat 549 
'•'^tgagaggaa eectggaaat gtgaagtctc tgttggtgtg ggagagatag tgagggcctg 609 
^^^^caaagaagg eaggtageag tcageatgae agetgeaaga atgacetetg tctgttgaag 669 
Mccttggtate tgagaggtca ggaaggggae etetttgagg gtaataacat aattggaaee 729 
j;Btgeeaetct tgageeaeaa tacctgtcac eagectgttg ttttaagaga gaaaaaaaat 789 
Ijjpaaggatatc tgattggagc aaaccacttc tttagtcatc tgtcttacet cectgggaca 849 
gctgttaect ttgeagtgtt gccgaatcae ageagttace tttgeaatgt tgecgaatea 909 
cagcagttet gttggagaaa egettggttt ecggatccag agccacagaa agaaatgtag 969 
gtgtgaagta ttaggctgct gtcagggaga ggatggeaga tggaggcate aageaeaagg 1029 
aaaatgcaca acetgtgcee tgttatacac aegttcatgt gegcecaaga acetatgact 1089 
ttcttccagt tcettetacc aggtccceat cetgetgcea gctetcaaca tagcaggcca 1149 
taggaeceag agaagaatcc cagtgttgct caaagtctga ceatcataaa gaeactgect 12 09 
gtettetagg aatgaceagg cacecagetc ecaetggaet ceaatttttt ttcctgcctt 1269 
atttagaatt ctttggcggg aagggtatga tgggttceca gagacaagaa gcccaacett 1329 
ctggeetggg etgtgctgat agtgetgagg gagataggaa tttgctgeta agatttttct 13 89 
ttggggtgga gttteetctg tgaggggett gcagctatee tteetgtgta tacaaataca 1449 
gtattttcea tgaaaaaaaa aaaaaa 1^75 



<210> 115 
<211> 321 
<212> DNA 
<213> Homo sapiens 



-98- 



<220> 
<221> CDS 
<222> 48 . .248 



<221> sig_peptide 
<222> 48. .161 
<223> Von Heijne matrix 
score 6.3 

seq LVFALVTAVCCLA/DG 



<221> polyA_signal 
<222> 283 . .288 



<221> polyA_site 
<222> 308. .321 



<400> 115 

gctgagaaga gttgagggaa agtgctgctg ctgggtctgc agacgcg atg aat aac 56 

Met Asn Asn 

gtg cag ccg aaa ata aaa cat cgc ccc ttc tgc ttc agt gtg aaa ggc 104 
Val Gin Pro Lys He Lys His Arg Pro Phe Cys Phe Ser Val Lys Gly 
..r^^ -30 -25 -20 

=jFcac gtg aag atg ctg egg ctg gtg ttt gca ctt gtg aca gca gta tgc 152 
l*lHis Val Lys Met Leu Arg Leu Val Phe Ala Leu Val Thr Ala Val Cys 

-15 -10 -5 

==^tgt ctt gcc gac ggg gcc ctt att tac egg aag ctt ctg ttc aat ccc 
jlCys Leu Ala Asp Gly Ala Leu He Tyr Arg Lys Leu Leu Phe Asn Pro 

1 5 10 

^pac ggt cct tac cag aaa aag cct gtg cat gaa aaa aaa gaa gtt ttg 
V;^sn Gly Pro Tyr Gin Lys Lys Pro Val His Glu Lys Lys Glu Val Leu 

15 20 25 

^tgattttata ttacttttta gtttgatact aagtattaaa catatttctg tattcttcca 3 08 



200 



248 



aaaaaaaaaa aaa 



321 



r^210> 116 
vk211> 450 
'<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 25. .399 



<221> sig_j)eptide 
<222> 25 , .186 
<223> Von Heijne matrix 
score 3 . 5 

seq SILAQVLDQSARA/RL 



<400> 116 

ctgctccagc gctgacgccg agcc atg gcg gac gag gag ctt gag gcg ctg 

Met Ala Asp Glu Glu Leu Glu Ala Leu 
-50 

agg aga cag agg ctg gcc gag ctg cag gcc aaa cac ggg gat cct ggt 
Arg Arg Gin Arg Leu Ala Glu Leu Gin Ala Lys His Gly Asp Pro Gly 



-99- 



-45 -40 -35 -30 

gat gcg gcc caa cag gaa gca aag cac agg gaa gca gaa atg aga aac 147 

Asp Ala Ala Gin Gin Glu Ala Lys His Arg Glu Ala Glu Met Arg Asn 

-25 -20 -15 

agt ate tta gcc caa gtt ctg gat cag teg gcc egg gcc agg tta agt 195 
Ser He Leu Ala Gin Val Leu Asp Gin Ser Ala Arg Ala Arg Leu Ser 

-10 -5 1 

aac tta gca ctt gta aag cct gaa aaa act aaa gca gta gag aat tac 243 
Asn Leu Ala Leu Val Lys Pro Glu Lys Thr Lys Ala Val Glu Asn Tyr 

5 10 15 

ctt ata cag atg gca aga tat gga caa eta agt gag aag gta tea gaa 291 
Leu He Gin Met Ala Arg Tyr Gly Gin Leu Ser Glu Lys Val Ser Glu 
20 25 30 35 

caa ggt tta ata gaa ate ctt aaa aaa gta age caa caa aca gaa aag 339 
Gin Gly Leu He Glu He Leu Lys Lys Val Ser Gin Gin Thr Glu Lys 

40 45 50 

aca aca aca gtg aaa ttc aac aga aga aaa gta atg gae tet gat gaa 387 
Thr Thr Thr Val Lys Phe Asn Arg Arg Lys Val Met Asp Ser Asp Glu 

55 60 65 

gat gae gat tat tgaactaeaa gtgetcacag actagaaett aacggaacaa 439 
Asp Asp Asp Tyr 
70 

jigtctaggaca g 



y1i<210> 117 

j;;^<211> 1173 

i|%<212> DNA 

i;|!5<213> Homo sapiens 

^\<220> 
^^^^<221> CDS 
I\l<222> 10. . 1137 

S^|<221> sigjpeptide 
yj|<222> 10. .72 
0^222> Von Heijne matrix 
score 6.5 

seq LLTLLLPPPPLYT/RH 

<221> polyA_signal 
<222> 1144 . . 1149 

<221> polyA_site 
<222> 1162. .1173 

<400> 117 

gagctgctt atg gga cac cgc ttc ctg cgc ggc etc tta acg ctg ctg ctg 51 
Met Gly His Arg Phe Leu Arg Gly Leu Leu Thr Leu Leu Leu 
-20 -15 _-|_Q 

ccg ccg cea cec ctg tat acc egg cac cgc atg etc ggt cea gag tec 99 
Pro Pro Pro Pro Leu Tyr Thr Arg His Arg Met Leu Gly Pro Glu Ser 

-5 15 
gtc ccg ccc cea aaa ega tec cgc age aaa etc atg gca ccg cec ega 147 
Val Pro Pro Pro Lys Arg Ser Arg Ser Lys Leu Met Ala Pro Pro Arq 
10 15 20 25 



-loc- 



ate ggg acg cac aat ggc acc ttc cac tgc gac gag gca ctg gca tgc 195 
He Gly Thr His Asn Gly Thr Phe His Cys Asp Glu Ala Leu Ala Cys 

30 35 40 

gca ctg ctt cgc etc ctg ccg gag tac egg gat gca gag att gtg egg 243 
Ala Leu Leu Arg Leu Leu Pro Glu Tyr Arg Asp Ala Glu He Val Arg 

45 50 55 

ace egg gat cce gaa aaa etc get tee tgt gac ate gtg gtg gac gtg 291 
Thr Arg Asp Pro Glu Lys Leu Ala Ser Cys Asp He Val Val Asp Val 

60 65 70 

ggg ggc gag tac gac ect egg aga cac ega tat gac cat cac cag agg 339 
Gly Gly Glu Tyr Asp Pro Arg Arg His Arg Tyr Asp His His Gin Arg 

75 80 85 

tet ttc aca gag ace atg age tec ctg tec ect ggg agg ccg tgg cag 387 
Ser Phe Thr Glu Thr Met Ser Ser Leu Ser Pro Gly Arg Pro Trp Gin 
^0 95 100 105 

ace aag ctg age agt geg gga etc ate tat ctg cac ttc ggg cac aag 435 
Thr Lys Leu Ser Ser Ala Gly Leu He Tyr Leu His Phe Gly His Lys 

110 115 120 

ctg ctg gee cag ttg ctg ggc act agt gaa gag gac age atg gtg ggc 483 
Leu Leu Ala Gin Leu Leu Gly Thr Ser Glu Glu Asp Ser Met Val Gly 
125 130 135 

^^_ace etc tat gac aag atg tat gag aac ttt gtg gag gag gtg gat get 531 
..iThr Leu Tyr Asp Lys Met Tyr Glu Asn Phe Val Glu Glu Val Asp Ala 
H 140 145 150 

vgtg gac aat ggg ate tee cag tgg gca gag ggg gag ect ega tat gca 579 
.iVal Asp Asn Gly He Ser Gin Trp Ala Glu Gly Glu Pro Arg Tyr Ala 

J1 160 165 

|;^tg ace act acc ctg agt gca ega gtt get ega ctt aat ect acc tgg 627 
'.Leu Thr Thr Thr Leu Ser Ala Arg Val Ala Arg Leu Asn Pro Thr Trp 
'pil70 175 180 185 

^=aac cac cce gac caa gac act gag gca ggg ttc aag cgt gca atg gat 675 
\Asn His Pro Asp Gin Asp Thr Glu Ala Gly Phe Lys Arg Ala Met Asp 
2 1^0 195 200 

:.^tg gtt caa gag gag ttt ctg cag aga tta gat ttc tac caa cac age 723 
^^K.eu Val Gin Glu Glu Phe Leu Gin Arg Leu Asp Phe Tyr Gin His Ser 
'4 205 210 215 

ifgg ctg cea gee egg gee ttg gtg gaa gag gee ctt gee cag ega ttc 771 
Vfrp Leu Pro Ala Arg Ala Leu Val Glu Glu Ala Leu Ala Gin Arg Phe 
220 225 230 

cag gtg gac eca agt gga gag att gtg gaa ctg geg aaa ggt gca tgt 819 
Gin Val Asp Pro Ser Gly Glu He Val Glu Leu Ala Lys Gly Ala Cys 

235 240 245 

cec tgg aag gag eat etc tac cac ctg gaa tet ggg ctg tee ect eca 867 
Pro Trp Lys Glu His Leu Tyr His Leu Glu Ser Gly Leu Ser Pro Pro 
250 255 260 265 

gtg gee ate ttc ttt gtt ate tac act gac cag get gga cag tgg ega 915 
Val Ala He Phe Phe Val He Tyr Thr Asp Gin Ala Gly Gin Trp Arg 

270 275 280 

ata cag tgt gtg cec aag gag cec cac tea ttc caa age egg ctg cce 963 
He Gin Cys Val Pro Lys Glu Pro His Ser Phe Gin Ser Arg Leu Pro 

285 290 295 

ctg eca gag cea tgg egg ggt ctt egg gac gag gee ctg gac cag gtc lOll 
Leu Pro Glu Pro Trp Arg Gly Leu Arg Asp Glu Ala Leu Asp Gin Val 

300 305 310 

agt ggg ate ect ggc tge ate ttc gtc eat gca age ggc ttc att ggc 1059 
Ser Gly He Pro Gly Cys He Phe Val His Ala Ser Gly Phe He Gly 
315 320 325 



-101- 



ggt cac cgc acc cga gag ggt gcc ttg age atg gcc cgt gcc acc ttg 110 7 

Gly His Arg Thr Arg Glu Gly Ala Leu Ser Met Ala Arg Ala Thr Leu 
330 335 340 345 

gcc cag cgc tea tac etc cea eaa ate tec tagtctaata aaacetteca 1157 
Ala Gin Arg Ser Tyr Leu Pro Gin He Ser 
350 355 
tctcaaaaaa aaaaaa 1173 



<210> 118 
<211> 785 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 72. .704 

<221> sig_peptide 

<222> 72 . .161 

<223> Von Heijne matrix 
CJ score 13 . 2 

kQ seq LLLLSTLVIPSAA/AP 

||.ii<221> polyA_signal 
jr|=222> 772 . . 777 

r;1c400> 118 

^^jjcggaatccgg gagtccggtg acccgggctg tggtctagca taaaggcgga gcccagaaga 60 
aggggcgggg t atg gga gaa gcc tec cca cct gcc ccc gca agg egg cat lio 
Met Gly Glu Ala Ser Pro Pro Ala Pro Ala Arg Arg His 
-30 -25 -20 

nptg ctg gtc ctg ctg ctg etc etc tet acc ctg gtg ate ccc tec get 158 
pS^eu Leu Val Leu Leu Leu Leu Leu Ser Thr Leu Val He Pro Ser Ala 
H -15 -10 -5 

,j^^ca get cet ate cat gat get gac gee eaa gag age tec ttg ggt etc 206 
Ala Pro He His Asp Ala Asp Ala Gin Glu Ser Ser Leu Gly Leu 
15 10 15 

aca ggc etc cag age eta etc eaa gge ttc age cga ctt ttc ctg aaa 254 
Thr Gly Leu Gin Ser Leu Leu Gin Gly Phe Ser Arg Leu Phe Leu Lys 

20 25 30 

ggt aac ctg ctt egg gge ata gac age tta ttc tct gee ccc atg gac 302 
Gly Asn Leu Leu Arg Gly He Asp Ser Leu Phe Ser Ala Pro Met Asp 

35 40 45 

ttc egg gge etc cet ggg aac tac cac aaa gag gag aac cag gag cac 350 
Phe Arg Gly Leu Pro Gly Asn Tyr His Lys Glu Glu Asn Gin Glu His 

50 55 60 

eag ctg ggg aac aac acc etc tec age cac etc cag ate gac aag gta 398 
Gin Leu Gly Asn Asn Thr Leu Ser Ser His Leu Gin He Asp Lys Val 

65 70 75 

ccc agg atg gag gag aag gag gcc ctg gta ccc ate cag aag gcc aeg 446 
Pro Arg Met Glu Glu Lys Glu Ala Leu Val Pro He Gin Lys Ala Thr 
80 85 90 95 

gac age ttc cac aca gaa etc cat ccc egg gtg gcc ttc tgg ate att 494 
Asp Ser Phe His Thr Glu Leu His Pro Arg Val Ala Phe Trp He He 
100 105 -L-j^O 



-102- 



aag ctg cca egg egg agg tec cac cag gat gcc ctg gag ggc ggc cac 542 
Lys Leu Pro Arg Arg Arg Ser His Gin Asp Ala Leu Glu Gly Gly His 

115 120 125 

tgg etc age gag aag cga cac cgc ctg cag gcc ate egg gat gga etc 590 
Trp Leu Ser Glu Lys Arg His Arg Leu Gin Ala He Arg Asp Gly Leu 

130 135 140 

cgc aag ggg ace cac aag gae gtc eta gaa gag ggg acc gag age tee 638 
Arg Lys Gly Thr His Lys Asp Val Leu Glu Glu Gly Thr Glu Ser Ser 

145 150 155 

tec cac tec agg ctg tec ccc cga aag acc cac tta ctg tac ate etc 686 
Ser His Ser Arg Leu Ser Pro Arg Lys Thr His Leu Leu Tyr He Leu 
160 165 170 175 

agg ccc tct egg cag ctg taggggtggg gaeeggggag cacctgcctg 734 
Arg Pro Ser Arg Gin Leu 
180 

tagcccccat cagaccctgc eccaagcacc atatggaaat aaagttettt c 785 



<210> 119 

<211> 559 

^^^^^<212> DNA 

'=-if<213> Homo sapiens 

nJ<220> 
M^<221> CDS 
yi<222> 44 . . 505 

^;;<221> sig_peptide 
T^222> 44 . .223 
■^^^<223> Von Heijne matrix 
score 4 

seq LVRRTLLVAALRA/WM 

lhik400> 119 

^^^^agcaaccaga gggagatgat cacctgaacc actgctecaa acc atg ggc agt aaa 55 

Met Gly Ser Lys 
0^ -60 
tgc tgt aaa ggt ggt cca gat gaa gat gca gta gaa aga cag agg egg 103 
Cys Cys Lys Gly Gly Pro Asp Glu Asp Ala Val Glu Arg Gin Arg Arg 

-55 -50 _45 

cag aag ttg ctt ctt gca caa ctg cat cac aga aaa agg gtg aag gca 151 
Gin Lys Leu Leu Leu Ala Gin Leu His His Arg Lys Arg Val Lys Ala 
-40 -35 -30 -25 

get ggg cag ate cag gcc tgg tgg cgt ggg gtc ctg gtg cgc agg acc 199 
Ala Gly Gin He Gin Ala Trp Trp Arg Gly Val Leu Val Arg Arg Thr 

-20 -15 -10 

ctg ctg gtt get gcc etc agg gcc tgg atg att cag tgc tgg tgg agg 247 
Leu Leu Val Ala Ala Leu Arg Ala Trp Met He Gin Cys Trp Trp Arg 

-5 15 
acg ttg gtg cag aga egg ate cgt cag egg egg cag gcc ctg ttg agg 295 
Thr Leu Val Gin Arg Arg He Arg Gin Arg Arg Gin Ala Leu Leu Arg 

10 15 20 

gtc tac gtc ate cag gag cag gcg acg gtc aag etc cag tec tgc ate 343 
Val Tyr Val He Gin Glu Gin Ala Thr Val Lys Leu Gin Ser Cys He 
25 30 35 40 

cgc atg tgg cag tgc egg caa tgt tac cgc caa atg tgc aat get etc 391 



-103- 



Arg Met Trp Gin Cys Arg Gin Cys Tyr Arg Gin Met Cys Asn Ala Leu 

45 50 55 

tgc ttg ttc cag gtc cca gag age age ctt gcc ttc cag act gat ggc 439 
Cys Leu Phe Gin Val Pro Glu Ser Ser Leu Ala Phe Gin Thr Asp Gly 

GO 65 70 

ttt tta cag gtc caa tat gca ate cet tea aag cag cca gag ttc cae 487 
Phe Leu Gin Val Gin Tyr Ala He Pro Ser Lys Gin Pro Glu Phe His 

75 80 85 

att gaa ate eta tea ate tgaaaggeet ggggcatgga gaacaggctg 535 
He Glu He Leu Ser He 
90 

eactacccta ataaatgtet gace 559 



<210> 120 
<211> 770 
<212> DNA 
<213> Homo sapiens 

<220> 
.^<221> CDS 
C!l<222> 25. .393 

nj<221> sig_peptide 
Nb<222> 25. . 150 
y^|<223> Von Heijne matrix 
jj: score 4 . 6 

seq LDPAVSLSAPAFA/SA 

'''<221> polyA_signal 
i' <222> 734. .739 

[lk221> polyA^site 
Hk222> 757. ,770 

jf^400> 120 

pfigeagaaagg agagacacac atae atg aaa gga gga get ttc tec aat ett 51 

Met Lys Gly Gly Ala Phe Ser Asn Leu 
-40 -35 
aat gat tec cag etc tea gcc teg ttt ctg caa ecc age etg caa gca 99 
Asn Asp Ser Gin Leu Ser Ala Ser Phe Leu Gin Pro Ser Leu Gin Ala 

-30 -25 -20 

aae tgt ect get ttg gae ect get gtg tea etc tee gca cca gee ttt 147 
Asn Cys Pro Ala Leu Asp Pro Ala Val Ser Leu Ser Ala Pro Ala Phe 

-15 -10 _5 

gcc tet get ett cgc tet atg aag tec tee cag get gca egg aag gae 195 
Ala Ser Ala Leu Arg Ser Met Lys Ser Ser Gin Ala Ala Arg Lys Asp 

is 10 ^3 

gae ttt etc agg tet ctt agt gat gga gae tea ggg aca tea gaa cae 243 
Asp Phe Leu Arg Ser Leu Ser Asp Gly Asp Ser Gly Thr Ser Glu His 

20 25 30 

ate tea gcg gtg gtg act age cet egg att tec tgc eat ggt get gee 291 
He Ser Ala Val Val Thr Ser Pro Arg He Ser Cys His Gly Ala Ala 

35 40 45 

att ecc ace gcc cgt gcc etc tgc eta ggc tgt tec tgc tgc ace gaa 339 
He Pro Thr Ala Arg Ala Leu Cys Leu Gly Cys Ser Cys Cys Thr Glu 



-104- 



50 55 60 

cgc etc etc ctg cca ccg cce tec etc ett tct tta gaa gee cct gee 38 7 

Arg Leu Leu Leu Pro Pro Pro Ser Leu Leu Ser Leu Glu Ala Pro Ala 

65 70 75 

age acc tgagctctct gctgattgct gttcctccca gtctgtggaa gctttgccca 443 
Ser Thr 
80 

tatgctttee ttaaaagggt tctgggeagg gcaggcgcce ccatttctea gggatccect 503 
ecaggacaac gccttttcct tgtgtcttca gctctcctta ecagatatet atatatttgt 563 
atatattcag tttcaccaac aatgcatcaa gtactttttt ttttaagtaa agaaccgcag 623 
tcatcgaact ggagccccat tgattccetc cecctegect ccccaaatct ggcacctgce 683 
caaggtatec teagaaccat ttggggtgte ctttggcatt ggataataga aataaaattt 743 
tacctctttc tacaaaaaaa aaaaaac 770 



<210> 121 
<211> 1213 
<212> DNA 

<213> Homo sapiens 

:.p<220> 
W<221> CDS 
y5<222> 58. .1095 

yh<221> sig__peptide 

yi<222> 58. .114 

j;:<223> Von Heijne matrix 

\]l score 5.4 

ip{ seq LSHLLPSLRQVIQ/EP 

' <221> polyA_site 
;'^^<222> 1202 . .1213 

i^^i^<400> 121 

^^Icctggctttg cctttgecet gctgtgtgat cttagetccc tgcccaggcc cacagcc 57 
.|;atg gee atg gee cag aaa etc age cac etc ctg ccg agt ctg egg cag 105 
rm^t Ala Met Ala Gin Lys Leu Ser His Leu Leu Pro Ser Leu Arg Gin 
-15 -10 _5 

gtc ate cag gag cct cag eta tct ctg cag cca gag cct gtc ttc acg 153 
Val He Gin Glu Pro Gin Leu Ser Leu Gin Pro Glu Pro Val Phe Thr 

15 10 
gtg gat ega get gag gtg ccg ccg etc ttc tgg aag ccg tae ate tat 201 
Val Asp Arg Ala Glu Val Pro Pro Leu Phe Trp Lys Pro Tyr He Tyr 

15 20 25 

gcg ggc tac egg ccg ctg cat cag acc tgg cgc ttc tat ttc cgc acg 249 
Ala Gly Tyr Arg Pro Leu His Gin Thr Trp Arg Phe Tyr Phe Arg Thr 
30 35 40 45 

ctg ttc cag cag cac aae gag gee gtg aat gtc tgg acc cac ctg ctg 297 
Leu Phe Gin Gin His Asn Glu Ala Val Asn Val Trp Thr His Leu Leu 

50 55 60 

gcg gee ctg gta ctg ctg ctg egg ctg gee etc ttt gtg gag acc gtg 345 
Ala Ala Leu Val Leu Leu Leu Arg Leu Ala Leu Phe Val Glu Thr Val 

65 70 75 

gac ttc tgg gga gae cca cac gee ctg cce etc ttc ate att gtc ett 393 
Asp Phe Trp Gly Asp Pro His Ala Leu Pro Leu Phe He He Val Leu 
80 85 90 



-105- 



gcc tct ttc acc tac etc tec etc agt gee ttg get cae etc ctg cag 441 
Ala Ser Phe Thr Tyr Leu Ser Leu Ser Ala Leu Ala His Leu Leu Gin 

95 100 105 

gee aag tct gag tte tgg cat tac age tte ttc ttc ctg gac tat gtg 489 
Ala Lys Ser Glu Phe Trp His Tyr Ser Phe Phe Phe Leu Asp Tyr Val 

115 120 125 

ggg gtg gee gtg tac cag ttt ggc agt gee ttg gea eac tte tae tat 537 
Gly Val Ala Val Tyr Gin Phe Gly Ser Ala Leu Ala His Phe Tyr Tyr 

130 135 140 

get ate gag eec gee tgg eat gee cag gtg cag get gtt ttt ctg ccc 585 
Ala He Glu Pro Ala Trp His Ala Gin Val Gin Ala Val Phe Leu Pro 

145 150 155 

atg get gcc ttt etc gee tgg ctt tec tge att ggc tec tge tat aac 633 
Met Ala Ala Phe Leu Ala Trp Leu Ser Cys He Gly Ser Cys Tyr Asn 

160 165 170 

aag tac ate cag aaa cea ggc ctg ctg ggc egc aca tge cag gag gtg 681 
Lys Tyr lie Gin Lys Pro Gly Leu Leu Gly Arg Thr Cys Gin Glu Val 

175 180 185 

eec tec gtc ctg gee tae gea ctg gac att agt ect gtg gtg cat egt 729 
Pro Ser Val Leu Ala Tyr Ala Leu Asp He Ser Pro Val Val His Arg 
190 195 200 205 

^^^atc ttc gtg tec tee gac ccc ace aeg gat gat cea get ctt etc tac 777 
.^^^Ile Phe Val Ser Ser Asp Pro Thr Thr Asp Asp Pro Ala Leu Leu Tyr 

210 215 22 0 

ycae aag tge cag gtg gtc ttc ttt ctg ctg get get gee ttc tte tct 825 
.mis Lys Cys Gin Val Val Phe Phe Leu Leu Ala Ala Ala Phe Phe Ser 
J1 225 230 235 

|;:ace tte atg ccc gag egc tgg ttc ect ggc age tge eat gtc tte ggg 873 
jjThr Phe Met Pro Glu Arg Trp Phe Pro Gly Ser Cys His Val Phe Gly 
"n 240 245 250 

''cag ggc eac caa ctt ttc cat ate ttc ttg gtg ctg tge aeg ctg get 921 
' Gin Gly His Gin Leu Phe His He Phe Leu Val Leu Cys Thr Leu Ala 

260 265 
=.bag ctg gag get gtg gea ctg gac tat gag gee cga egg ece ate tat 969 
'^^In Leu Glu Ala Val Ala Leu Asp Tyr Glu Ala Arg Arg Pro He Tyr 
^^^^70 275 280 285 

0ag ect ctg eac aeg eac tgg ect eac aac ttt tct ggc etc ttc ctg 1017 
=^^lu Pro Leu His Thr His Trp Pro His Asn Phe Ser Gly Leu Phe Leu 
290 295 300 

etc aeg gtg ggc age age ate etc act gea ttc etc ctg age cag ctg 1065 
Leu Thr Val Gly Ser Ser He Leu Thr Ala Phe Leu Leu Ser Gin Leu 

305 310 315 

gta cag egc aaa ctt gat cag aag acc aag tgaaggggga tggeatetgg 1115 
Val Gin Arg Lys Leu Asp Gin Lys Thr Lys 

320 325 
tagggaggga ggtatagttg ggggacaggg gtctgggttt ggctccaagt gggaaeaagg 1175 
Gctggtaaag ttgtttgtgt ctggecaaaa aaaaaaaa 12 13 



<210> 122 

<211> 1318 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 



-106- 



<222> 31. .660 

<221> sig_peptide 
<222> 31. .90 
<223> Von Heijne matrix 
score 5.4 

seq AFVIACVLSLIST/IY 

<221> polyA_signal 
<222> 1288. .1293 

<221> polyA_site 
<222> 1307. .1318 

<400> 122 

ggaggatggg cgagcagtct gaatgccaga atg gat aac cgt ttt get aca gca 54 

Met Asp Asn Arg Phe Ala Thr Ala 
-20 -15 

ttt gta att get tgt gtg ctt age etc att tec ace ate tac atg gca 102 

Phe Val He Ala Cys Val Leu Ser Leu He Ser Thr He Tyr Met Ala 
-10 -5 1 

get tec att ggc aca gac tte tgg tat gag tat ega agt cea gtt caa 150 
i,jAla Ser He Gly Thr Asp Phe Trp Tyr Glu Tyr Arg Ser Pro Val Gin 
j;P 10 15 20 

HB^a. aat tec agt gat ttg aat aaa age ate tgg gat gaa ttc att agt 198 
l.iiGlu Asn Ser Ser Asp Leu Asn Lys Ser He Trp Asp Glu Phe He Ser 
yi 2^ 30 35 

j;=gat gag gca gat gaa aag act tat aat gat gca ctt ttt ega tac aat 246 
:jAsp Glu Ala Asp Glu Lys Thr Tyr Asn Asp Ala Leu Phe Arg Tyr Asn 
PI 4 0 4 5 50 

^^^gc aca gtg gga ttg tgg aga egg tgt ate acc ata ccc aaa aac atg 294 
= Gly Thr Val Gly Leu Trp Arg Arg Cys He Thr He Pro Lys Asn Met 

55 60 65 

IJcat tgg tat age cea cea gaa agg aca gag tea ttt gat gtg gtc aca 342 
.m±s Trp Tyr Ser Pro Pro Glu Arg Thr Glu Ser Phe Asp Val Val Thr 
q 70 75 8 0 

l^aa tgt gtg agt tte aca eta act gag cag ttc atg gag aaa ttt gtt 3 90 

;;jjys Cys Val Ser Phe Thr Leu Thr Glu Gin Phe Met Glu Lys Phe Val 

90 95 100 

gat ccc gga aac cac aat age ggg att gat etc ctt agg ace tat ctt 438 
Asp Pro Gly Asn His Asn Ser Gly He Asp Leu Leu Arg Thr Tyr Leu 

105 110 115 

tgg egt tgc cag tte ctt tta ect ttt gtg agt tta ggt ttg atg tgc 486 
Trp Arg Cys Gin Phe Leu Leu Pro Phe Val Ser Leu Gly Leu Met Cys 

120 125 130 

ttt ggg get ttg ate gga ctt tgt get tgc att tgc ega age tta tat 534 
Phe Gly Ala Leu He Gly Leu Cys Ala Cys He Cys Arg Ser Leu Tyr 

135 140 145 

ccc acc att gee aeg ggc att etc eat etc ctt gca gtg aca aag gag 582 
Pro Thr He Ala Thr Gly He Leu His Leu Leu Ala Val Thr Lys Glu 

150 155 160 

age atg ctt cea get gga get gag tec aag cac aca gee act ect gca 630 
Ser Met Leu Pro Ala Gly Ala Glu Ser Lys His Thr Ala Thr Pro Ala 

170 175 180 

eac gca tgc gtg caa aca ggg aag ccc aag taggagaaga ggaaagaggt 680 
His Ala Cys Val Gin Thr Gly Lys Pro Lys 
185 190 
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tgtagggatt tgggaagaac cttgattatt ccctggagga aaagacaaat ctacttccct 740 

gaaatcaccc tcgaatctac ttccaccctc agaacttaaa atgaactgca tccttttttt 800 

catcttcttt tcttctccag tgaatatgat ctccaaaccc ttattttttc tttgaactgt 860 

aaaatttcca ctcatggacg atgcaaccaa cagatgcaat ctctgagaag atgaaaattg 920 

ggacctctta ttataaaatt gacctagctg gactcaggaa accagggaag aagtcaatgc 980 

aggcatttaa aatgtaaagt tttttctggt taaatctatt tatttttctt gtaggttgag 1040 

tatttcttcc cagtttttct gctctggtgt ataacaaaca ggtcaaaatt tcccatcttt 1100 

cctcctgata gtagttgaat cctaccttgc atacttaatg catagtgaaa tggcatctag 1160 

cagaaataca cacccccaaa acacaccacc atttcattag gtgcccaaaa aattctgtat 1220 

ttagcttatt tatttattgt tatttttgct ttttcttaac ccactatata ttgactgcaa 1280 

acgaattaat aaattatccc ttctggaaaa aaaaaaaa 1318 



<210> 123 
<211> 853 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 31. .582 

y3<221> sigjpeptide 

nj<222> 31. .90 

|:^ii<223> Von Heijne matrix 

|j1 score 5 . 4 

J« seq AFVIACVLSLIST/IY 

j*;i^221> polyA^signal 
'''<222> 816. .821 

5;;')=221> polyA_site 
nj<222> 840 . .853 

'^^ic:400> 123 

;:|pgaggatggg cgagcagtct gaatgccaga atg gat aac cgt ttt get aca gca 54 

Met Asp Asn Arg Phe Ala Thr Ala 
-20 -15 
ttt gta att get tgt gtg ctt age cte att tec acc ate tae atg gca 102 
Phe Val He Ala Cys Val Leu Ser Leu He Ser Thr He Tyr Met Ala 

-10 -5 1 

gee tee att gge aea gac ttc tgg tat gaa tat cga agt cca gtt caa 150 
Ala Ser He Gly Thr Asp Phe Trp Tyr Glu Tyr Arg Ser Pro Val Gin 
5 10 15 20 

gaa aat tec agt gat ttg aat aaa age ate tgg gat gaa tte att agt 198 
Glu Asn Ser Ser Asp Leu Asn Lys Ser He Trp Asp Glu Phe He Ser 

25 30 35 

gat gaa gca gat gaa aag act tat aat gat gca cct ttt cga tae aat 246 
Asp Glu Ala Asp Glu Lys Thr Tyr Asn Asp Ala Pro Phe Arg Tyr Asn 

40 45 50 

ggc aca gtg gga ttg tgg aga egg tgt ate acc at a cec aaa aac atg 294 
Gly Thr Val Gly Leu Trp Arg Arg Cys He Thr He Pro Lys Asn Met 

55 60 65 

eat tgg tat age eca cca gaa agg aca gag tea ttt gat gtg gtc aca 342 
His Trp Tyr Ser Pro Pro Glu Arg Thr Glu Ser Phe Asp Val Val Thr 
70 75 80 
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aaa 


tgt 


gtg 


agt 


ttc 


aca 


eta 


act 


gag 


cacr 


ttc 


atg 




aaa 


L> L. L- 


rrt-i- 


"3 0 0 


Lys 


Cys 


Val 


Ser 


Phe 


Thr 


Leu 


Thr 


Glu 


Gin 


Phe 


Met 


Glu 


Lys 


IT lie 


Val 




85 










90 










95 








lUU 




gat 


ccc 


gga 


aac 


cac 


aat 


age 


ggg 


att 




etc 


^ U L« 


agg 


ace 


i- 3 -t- 

uat 


ecu 


438 


Asp 


Pro 


Gly 


Asn 


His 


Asn 


Ser 


Gly He 


Asp 


Leu 


Leu 






iyr 


Leu 












105 










110 
















tgg 


cgt 


tgc 


cag 


ttc 


ctt 


tta 


ect 


ttt 


crta 




tt;g 




i-i-rr 


atg 


tgc 


486 


Trp 


Arg 


Cys 


Gin 


Phe 


Leu 


Leu 


Pro 


Phe 


Val 


Ser 


T.on 


Gly 


Leu 


jyieu 


Cys 










120 










125 










Xo u 






ttt 


ggg 


get 


ttg 


ate 


gga 


ctt 


tgt 


get 


tgc 


att 


tgc 


cga 


age 


tta 


tat 


534 


Phe 


Gly 


Ala 


Leu 


He 


Gly 


Leu 


Cys 


Ala 


Cys 


He 


Cys 


Arg 


Ser 


Leu 


Tyr 








135 










140 










145 








ccc 


acc 


att 


gcc 


acg 


ggc 


att 


etc 


cat 


etc 


ctt 


gca 


gat 


acc 


atg 


ctg 


582 


Pro 


Thr 


He 


Ala 


Thr 


Gly 


He 


Leu 


His 


Leu 


Leu 


Ala 


Asp 


Thr 


Met 


Leu 






150 










155 










160 











tgaagtceag gccacatgga ggtgtcctgt gtagatgctc cagctgaaat cccaagctaa 642 

gctceeaact gaeagecaac atcatttcca gecatgtgtg ggagceatee tggatgteca 702 

gccttaacaa geettcagag gaettcagcc acagctatta tettactaca tccttgtgag 762 

actctaataa agaaecaact agctgagcec aatcaaecta tggaactgat agaaataaaa 822 

tgaattgttg ttttgcgaaa aaaaaaaaaa a 853 



y;i<210> 124 

nj<211> 826 

M<212> DNA 

|]i<213> Homo sapiens 

i'f^<220> 
^^;;^<221> CDS 
^^^^<222> 15. .695 

r*^<221> sigjpeptide 
fy<222> 15. .80 
pl.<223> Von Heijne matrix 
score 8 . 5 

seq AALLLGLMMWTG/DE 

"'''<221> polyA_signal 
<222> 795 . .800 



<221> polyA_site 
<222> 814, ,826 



<400> 124 

aaccagaggt gecc atg ggt 
Met Gly 

tta ctg ggt etc atg atg 
Leu Leu Gly Leu Met Met 
-10 -5 
ceg tgt gee cat gag gee 
Pro Cys Ala His Glu Ala 
10 

ggc ctt gaa gtt ttc tac 
Gly Leu Glu Val Phe Tyr 
25 



tgg aca atg agg ctg gte 
Trp Thr Met Arg Leu Val 
-20 -15 
gtg gte act gga gac gag 
Val Val Thr Gly Asp Glu 
1 

etc ctg gac gag gac ace 
Leu Leu Asp Glu Asp Thr 
15 

cea gag ttg ggg aac att 
Pro Glu Leu Gly Asn He 
30 



aca gca gca ctg 50 
Thr Ala Ala Leu 

gat gag aac age 98 
Asp Glu Asn Ser 
5 

etc ttt tgc cag 146 
Leu Phe Cys Gin 
20 

ggc tgc aag gtt 194 

Gly Cys Lys Val 

35 
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gtt 


cct 


gat 


tgt 


aac 


aac tac 


aga 


cag 


aag 


ate 


ace 


tee 


tgg atg 


gag 


242 


Val 


Pro 


Asp 


Cys 


Asn 


Asn Tyr Arg 


Gin 


Lys 


He 


T'Vi T* 

xnr 


Q j-v V* 

oer 


Trp Met 


Glu 






40 








45 










D u 








ccg 


ata 


gtc 


aag 


tte 


ccg ggg 


gee 


gtg 


gac 


ggc 




aec 


tat ate 


etg 


2 90 


Pro 


He 


Val 


Lys 


Phe 


Pro Gly Ala Val 


Asp Gly 




Thr 


Tyr He 


Leu 




55 










60 








65 






70 






atg 


gtg gat 


cca 


gat gcc 


cct 


age 


aga 


gca 


gaa 


ccc 


aga cag 


aga 


338 


Val 


Met 


Val 


Asp Pro Asp Ala 


Pro 


Ser 


Arg Ala 


CjIU 


Pro 


Arg Gin Arg 












75 








80 








85 






ttc 


tcrcr 


aga 


cat 


tgg 


etg gta 


aca 


gat 


ate 


aag 


ggc 


gcc 


gac etg 


aag 


386 


Phe 


Trn 


Arg 


His 


Trp 


Leu Val 


Thr 


Asp 


He 


Lys 


Cjiy 


Ala Asp Leu Lys 










90 








95 










100 






aaa 


CTcrcr 


aag 


att 


cag 


ggc cag 


gag 


tta 


tea 


gee 


tae 


cag 


get ccc 


tec 


434 


Lys 


Gly 

J: 


Lys 


He 


Gin Gly Gin Glu Leu 


Ser 


Ala 


Tyr 


Gin 


Ala Pro 


Ser 








105 








110 










115 








cca 


CCQ 


gca 


cac 


agt 


ggc ttc 


cat 


egc 


tac 


cag 


i- -h <^ 

CCC 


ttt 


gtc tat 


ctt 


482 


Pro 


Pro 


Ala 


His 


Ser 


Gly Phe 


His 


Arg 


Tyr 


Gin 


fne 


Phe 


Val Tyr 


Leu 






120 








125 










ISO 








cag 


gaa 


gga 


aag gtc 


ate tct 


etc 


ctt 


ccc 


aag 


gaa 


aac 


aaa act 


cga 


530 


Gin 


Glu 


Gly Lys Val 


He Ser 


Leu 


Leu 


Pro 


Lys 


Glu 


Asn 


Lys Thr 


Arg 




135 










140 








145 






150 






tot 


tgg 


aaa 


atg 


gac aga 


ttt 


etg 


aac 


cgt 


tte 


cac 


etg ggc 


gaa 


578 


=i==i-<jly Ser 


Trp 


Lys 


Met 


Asp Arg 


Phe 


Leu 


Asn 


Arg 


Phe 


His 


Leu Gly Glu 












155 








160 








165 






llJcct 


gaa 


gca 


age 


ace 


cag tte 


atg 


ace 


cag 


aac 


tac 


cag 


gac tea 


cca 


626 


^:fi:Pro 


Glu 


Ala 


Ser 


Thr 


Gin Phe 


Met 


Thr 


Gin 


Asn 


Tyr 


Gin Asp Ser 


Pro 










170 








175 










180 






ii*=acc 


etc 


cag 


get 


ccc 


aga gaa 


agg 


gee 


age 


gag 


ccc 


aag 


cac aaa 


aac 


674 




Leu 


Gin 


Ala 


Pro 


Arg Glu 


Arg 


Ala 


Ser 


Glu 


Pro 


Lys 


His Lys 


Asn 








185 








190 










195 






bag 


gcg 


gag 


ata 


get 


gee tge 


tagatagceg gctttgccat ccgggeatgt 




725 


f Gin 


Ala 


Glu 


He 


Ala 


Ala Cys 






















200 








205 





















J^^gceacaetg eceaceaccg acgatgtggg tatggaaeec ectetggata cagaacecet 785 
^H^tcttttecaa ataaaaaaaa aatcatccaa aaaaaaaaaa a 826 



"<210> 125 
<211> 571 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 74. .295 

<221> sig_peptide 
<222> 74 . . 196 
<223> Von Heijne matrix 
score 5.4 

seq RLLYIGFLGYCSG/LI 

<221> polyA_signal 
<222> 545 , . 550 

<221> polyA_site 
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<222> 561. .571 
<400> 125 

cgggtagtgg tcgtcgtggt tttccttgta gttcgtggtc tgagaccagg cctcaagtgg 60 
aaacggcgtc acc atg ate gca egg egg aac cca gta ccc tta egg ttt 109 
Met He Ala Arg Arg Asn Pro Val Pro Leu Arg Phe 
-40 -35 -30 

ctg ecg gat gag gee egg age ctg ccc ccg ccc aag ctg acc gac ccg 157 
Leu Pro Asp Glu Ala Arg Ser Leu Pro Pro Pro Lys Leu Thr Asp Pro 

-25 -20 -15 

egg etc etc tac ate gge ttc ttg ggc tac tgc tec ggc ctg att gat 205 
Arg Leu Leu Tyr He Gly Phe Leu Gly Tyr Cys Ser Gly Leu He Asp 

-10 -5 1 

aac ctg ate egg egg agg ccg ate gcg acg get ggt ttg cat cgc cag 253 
Asn Leu He Arg Arg Arg Pro He Ala Thr Ala Gly Leu His Arg Gin 

5 10 15 

ett eta tat att acg gee ttt ttt ttg ctg gat att ate ttg 295 
Leu Leu Tyr He Thr Ala Phe Phe Leu Leu Asp He He Leu 
20 25 30 

taaaacgtga agactacctg tatgctgtga gggacegtga aatgtttgga tatatgaaat 355 
tacatceaga ggatttteet gaagaagata agaaaacata tggtgaaatt tttgaaaaat 415 
^^^ tecatecaat acgttgaagt cttcaaaatg cttgetccag ttteactgat aeetgetgtt 475 
.j^eetgaatttg atggaacatg tttcttatga eagttgaage ttatgetaat ctgtatgttg 535 
l^^caccttgta attaaaatae gtaccaaaaa aaaaaa 571 



jj52lO> 126 
Jfk211> 659 
f^212> DNA 
''''<213> Homo sapiens 

NHc22 0> 
[l|<221> CDS 
|"k222> 440. .658 

'!„ 'i 

;,jj|c221> polyA_signal 
^k222> 601 . .606 

<400> 126 

cgccttacga gctgggaggt ggtgcctcte acceagetaa ttgetcteta gecettggce 60 
tteacaggtg ttggtgcetg cegtgaaegc attetgacet gggeegtate tgtctcceaa 120 
gactttgtgc ctatggttgg ggaeagagtg aggtcgttgc ettgacgaeg acagcatgcg 180 
gceegtggte ctcetaagtg tgagettgcg geggaccgag gcecacctgc eteectgcet 240 
gcttcgecca ggactegtga ctgcgtcegc agaagaaatc acaacagcgc tggaattget 300 
agtttgctag gcagcatett ttggaectgc gaaceatatg catttcaeet caaatctgtt 360 
tceaagttga aaaectttgg gtetttctat gcgaacggat tgaagaaacg eaaaaagttt 42 0 
etacggactt taaattaaa atg gaa aaa tat gaa aac ctg ggt ttg gtt gga 472 

Met Glu Lys Tyr Glu Asn Leu Gly Leu Val Gly 
15 10 
gaa ggg agt tat gga atg gtg atg aag tgt agg aat aaa gat act gga 520 
Glu Gly Ser Tyr Gly Met Val Met Lys Cys Arg Asn Lys Asp Thr Gly 

15 20 25 

aga att gtg gee ata aag aag tte tta gaa agt gac gat gac aaa atg 568 
Arg He Val Ala He Lys Lys Phe Leu Glu Ser Asp Asp Asp Lys Met 

30 35 4Q 

gtt aaa aag att gca atg cga gaa gtc aag tta eta aag eaa ett agg 616 



-ni- 



val Lys Lys He Ala Met Arg Glu Val Lys Leu Leu Lys Gin Leu Arg 

45 50 55 

cat gaa aac ttg gtg aat etc ttg gaa gtg tgt aaa aaa aaa a 
His Glu Asn Leu Val Asn Leu Leu Glu Val Cys Lys Lys Lys 
60 65 70 



<210> 127 
<211> 301 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 38. .283 

<22l> sig_j)eptide 
<222> 38. .85 
<223> Von Heijne matrix 
score 4 . 1 

seq LLPATSLAGPVLS/TL 

yil<221> polyA_signal 
n,k222> 257. .262 

|i^|<400> 127 

j;^acctgaatc ccaggaaccc tcaatgaggt cttcaag atg aag aga ctg ctg cca 55 
Ijj Met Lys Arg Leu Leu Pro 

m -15 

■-get aec age ctg get gge cct gtc ctg tee aec etc att gee cca act 103 
Ala Thr Ser Leu Ala Gly Pro Val Leu Ser Thr Leu He Ala Pro Thr 

!;:r^o -5 1 5 

v^ac atg ttg ttt tgt gaa gat aaa age tgg gat ctt ttt ctt ttt ttt 151 
Ir^^ro Met Leu Phe Cys Glu Asp Lys Ser Trp Asp Leu Phe Leu Phe Phe 
M 10 15 20 

y|aag tet cac aag aca tgg ggc ate tec aca aat tta agt tec tgt cca 199 
Ser His Lys Thr Trp Gly He Ser Thr Asn Leu Ser Ser Cys Pro 
25 30 35 

ttt gga aat ttg ttt eta tgt gta cag ttt gtc aga gaa aaa caa agt 247 

Phe Gly Asn Leu Phe Leu Cys Val Gin Phe Val Arg Glu Lys Gin Ser 
40 45 50 

ttt tgt atg aat aca gaa tgt gat tta cge aag aat tgacaaaaaa 293 

Phe Cys Met Asn Thr Glu Cys Asp Leu Arg Lys Asn 

55 60 65 

aaaaaaaa 



<210> 128 
<211> 477 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 121. .477 
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<221> sig_j)eptide 
<222> 121. .288 
<223> Von Heijne matrix 
score 3 . 5 

seq SSCADSFVSSSSS/QP 
<400> 128 

cctcggagca ggcggagtaa agggacttga gcgagccagt tgccggatta ttctatttcc 60 
cctccctctc tcccgccccg tatctctttt cacccttctc ccaccctcgc tcgcgtagcc 120 
atg gcg gag ccg teg gcg gcc act cag tec cat tec ate tee teg teg 168 
Met Ala Glu Pro Ser Ala Ala Thr Gin Ser His Ser He Ser Ser Ser 

-55 -50 -45 

tec ttc gga gee gag ccg tee gcg cec ggc ggc ggc ggg age cca gga 216 
Ser Phe Gly Ala Glu Pro Ser Ala Pro Gly Gly Gly Gly Ser Pro Gly 
-40 -35 -30 .25 

gee tge cec gcc ctg ggg acg aag age tgc age tec tec tgt gcg gat 264 
Ala Cys Pro Ala Leu Gly Thr Lys Ser Cys Ser Ser Ser Cys Ala Asp 

-20 -15 -10 

tec ttt gtt tet tec tct tec tet cag cet gta tct eta ttt teg ace 312 
Ser Phe Val Ser Ser Ser Ser Ser Gin Pro Val Ser Leu Phe Ser Thr 

.j^tea caa gag gga ttg age tet ett tge tet gat gag cca tct tea gaa 360 
J,|Ser Gin Glu Gly Leu Ser Ser Leu Cys Ser Asp Glu Pro Ser Ser Glu 

III 10 15 20 

.iiatt atg act tct tec ttt ett tea tct tct gaa ata cat aac act ggc 408 

J^ple Met Thr Ser Ser Phe Leu Ser Ser Ser Glu He His Asn Thr Gly 

|?5 30 35 40 

^ptt aca ata eta cat gga gaa aaa age eat gtg tta ggg age cag ect 456 

;:;^eu Thr He Leu His Gly Glu Lys Ser His Val Leu Gly Ser Gin Pro 

45 50 55 

■ att tta gcc aaa aaa aaa aaa 47-7 
^'^'^Ile Leu Ala Lys Lys Lys Lys 
lj 60 



;;.f:210> 129 

'^^'<211> 323 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 2. .163 

<221> polyA__signal 
<222> 292. .297 

<221> polyA_site 
<222> 310. .323 



<400> 129 

a get ttc gtg tgg gag cca get atg gtg egg ate aat gcg ctg aca gca 49 
Ala Phe Val Trp Glu Pro Ala Met Val Arg He Asn Ala Leu Thr Ala 
^5 10 15 

gee tet gag get gcg tgc ctg ate gtg tet gta gat gaa ace ate aag 97 
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Ala Ser Glu Ala Ala Cys Leu He Val Ser Val Asp Glu Thr He Lys 

20 25 30 

aac ccc cgc teg act gtg gat get ccc aca gca gca ggc egg ggc cgt 145 
Asn Pro Arg Ser Thr Val Asp Ala Pro Thr Ala Ala Gly Arg Gly Arg 

35 40 45 

ggt cgt ggc cgc ccc cac tgagaggcac cccacccatc acatggctgg 193 
Gly Arg Gly Arg Pro His 
50 

ctggctgctg ggtgcactta ccctccttgg cttggttact tcattttaca aggaaggggt 253 
agtaattggc ccactctctt cttactggag gctatttaaa taaaatgtaa gacttcaaaa 313 
aaaaaaaaaa to-j 



<210> 130 

<211> 1392 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
,„,^.<222> 46. .675 

5lJk221> sig_jpeptide 

iilk222> 46. .87 

Nk223> Von Heijne matrix 

[fl score 5 . 3 

j;;: seq LTLLGLSFILAGL/IV 

h'p^<221> polyA^signal 
'^^^<222> 1364. .1369 

ii''^:221> polyA_site 
P*l<222> 1383 . .1392 



:,|<400> 130 

IJptccgagttg ccacccagga aaaagagggc tcctctggga gatgt atg ctt act etc 57 
i'l Met Leu Thr Leu 

tta ggc ctt tea ttc ate ttg gca gga ctt att gtt ggt gga gee tge 105 
Leu Gly Leu Ser Phe He Leu Ala Gly Leu He Val Gly Gly Ala Cys 
-10 -5 15 

att tac aag tae ttc atg ccc aag age ace att tac cgt gga gag atg 153 
He Tyr Lys Tyr Phe Met Pro Lys Ser Thr He Tyr Arg Gly Glu Met 

10 15 20 

tge ttt ttt gat tet gag gat cet gca aat tec ctt cgt gga gga gag 2 01 

Cys Phe Phe Asp Ser Glu Asp Pro Ala Asn Ser Leu Arg Gly Gly Glu 

25 30 35 

cet aac ttc etg cet gtg act gag gag get gae att cgt gag gat gae 249 
Pro Asn Phe Leu Pro Val Thr Glu Glu Ala Asp He Arg Glu Asp Asp 

40 45 50 

aac att gca ate att gat gtg cet gte cec agt ttc tet gat agt gae 297 
Asn He Ala He He Asp Val Pro Val Pro Ser Phe Ser Asp Ser Asp 
55 60 65 70 

cet gea gea att att cat gae ttt gaa aag gga atg act get tac etg 345 
Pro Ala Ala He He His Asp Phe Glu Lys Gly Met Thr Ala Tyr Leu 

75 80 85 

gae ttg ttg etg ggg ate tge tat etg atg ccc etc aat act tet att 393 



-114- 



Asp Leu Leu Leu Gly He Cys Tyr Leu Met Pro Leu Asn Thr Ser He 

90 95 100 

gtt atg cct cca aaa aat ctg gta gag etc ttt ggc aaa ctg gcg agt 441 
Val Met Pro Pro Lys Asn Leu Val Glu Leu Phe Gly Lys Leu Ala Ser 

105 110 115 

ggc aga tat ctg cct caa act tat gtg gtt cga gaa gac eta gtt get 489 
Gly Arg Tyr Leu Pro Gin Thr Tyr Val Val Arg Glu Asp Leu Val Ala 

120 125 130 

gtg gag gaa att cgt gat gtt agt aac ett ggc ate ttt att tac caa 53 7 

Val Glu Glu He Arg Asp Val Ser Asn Leu Gly He Phe He Tyr Gin 

140 145 150 

ett tgc aat aac aga aag tec ttc cgc ctt cgt cgc aga gac etc ttg 585 
Leu Cys Asn Asn Arg Lys Ser Phe Arg Leu Arg Arg Arg Asp Leu Leu 

155 160 165 

ctg ggt ttc aac aaa cgt gee att gat aaa tgc tgg aag att aga cac 633 
Leu Gly Phe Asn Lys Arg Ala He Asp Lys Cys Trp Lys He Arg His 

170 175 180 

ttc ccc aac gaa ttt att gtt gag ace aag ate tgt caa gag 675 
Phe Pro Asn Glu Phe He Val Glu Thr Lys He Cys Gin Glu 

185 190 195 

taagaggcaa cagatagagt gtccttggta ataagaagtc agagatttac aatatgactt 73 5 
.^^^^taacattaag gtttatggga tactcaagat atttactcat geatttaetc tattgettat 795 
";;;;^ctttaaaaa aaggaaaaaa aaaaaactae taaccactgc aagetcttgt caaattttag 855 
JJJtttaattggc attgcttgtt ttttgaaact gaaattacat gagtttcatt ttttetttgc 915 
ILi^tttataggg tttagattte tgaaagcagc atgaatatat cacctaacat ectgacaata 975 
Hjaattecatcc gttgtttttt ttgtttgttt gttttttett ttcctttaag taagctcttt 1035 
imttcatctta tggtggagca attttaaaat ttgaaatatt ttaaattgtt tttgaacttt 1095 
::|?tgtgtaaaa tatatcagat ctcaacattg ttggtttctt ttgtttttca ttttgtacaa 1155 
yptttcttgaa tttagaaatt acatctttgc agttctgtta ggtgctctgt aattaacctg 1215 
y^pcttatatgt gaacaatttt catgagaeag teatttttaa etaatgcagt gattctttct 1275 
■'cactactatc tgtattgtgg aatgcacaaa attgtgtagg tgctgaatgc tgtaaggagt 1335 
r.ttaggttgta tgaattctac aaccctataa taaattttac tctatacaaa aaaaaaa 1392 



Sk210> 


131 


J|c211> 


999 


H^212> 


DNA 


"<213> 


Homo sapiens 


<220> 




<221> 


CDS 


<222> 


62, .385 


<221> 


polyA_signal 


<222> 


974, .979 


<221> 


polyA_site 


<222> 


987 . . 999 


<400> 


131 



cctgaatgac ttgaatgttt ccccgeetga gctaacagtc catgtgggtg attcagctct 
g atg gga tgt gtt ttc cag age aca gaa gac aaa tgt ata ttc aag ata 
Met Gly Cys Val Phe Gin Ser Thr Glu Asp Lys Cys He Phe Lys He 
15 10 15 

gac tgg act ctg tea cca gga gag cac gee aag gac gaa tat gtg eta 
Asp Trp Thr Leu Ser Pro Gly Glu His Ala Lys Asp Glu Tyr Val Leu 
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9 n 




















30 








tcLC 


tat 


tac 


t CO 


a a "t" 


/-I -f- /-I 


dy U 


gtg 


cct 


•I 4- i- 

acc 


ggg 


cgc 


ttc 


cag 


aac 


cgc 


205 


Tvr 


Tvr 


Tvr 


Ser 




u. 


O ex 


Vo. J. 


Pro 


X j.e 


Gly Arg 


Phe 


Gin 


Asn 


Arg 








35 




























gta 


cac 


tta 


atcr 


crcrcr 


gac 


ate 




tgc 


a i- 


gat 


ggc 


ucc 


etc 


etg 


etc 


253 


Val 


His 


Leu 


Met 


Gly 


Asp 


lie 


Leu 


Cys 


Asn 


Asp Gly 


Ser 


Leu 


Leu 


Leu 






50 




















60 












caa 


gat 


crtcr 


caa 


y^y 


y ^ 


y ctv_ 


cag 


gga 


acc 


tat 


ate 


tgt 


gaa 


ate 


cgc 


301 


Gin 


Asp 


Val 


Gin 


Glu 


Ala 


Asp 


Gin 


Gly 


Thr 


Tyr 


He 


Cys 


Glu 


He 


Arg 




O D 










70 










75 










80 




etc 


aaa 


ggg 


gag 


age 


cag 


gtg 


tte 


aag 


aag 


gcg 


gtg 


gta 


etg 


eat 


gtg 


349 


Leu 


Lys 


Gly 


Glu 


Ser 
85 


Gin 


Val 


Phe 


Lys 


Lys 
90 


Ala 


Val 


Val 


Leu 


His 
95 


Val 




ctt 


cca 


gag 


gag 


ccc 


aaa 


ggt 


acg 


caa 


atg 


ctt 


act 


taaagagggg 




395 


Leu 


Pro 


Glu 


Glu 
100 


Pro 


Lys 


Gly 


Thr 


Gin 
105 


Met 


Leu 


Thr 













ccaaggggca agagctttca tgtgcaagag gcaaggaaac tgattatctt gagtaaatgc 455 

cagcctttgg gctaagtact taccacagag tgaatcttca aaaaatgatc ataattattt 515 

eagtcaataa aaatagagtt attttattaa ataaaatatt gataattatt gtattattae 575 

tttaaacaca cttcecccte acaaaagcee tgtgaaggat gttttgttca catatatgtc 635 

caaatatgtt ttggacacat atttattaaa tggaataaat agtacttgaa ccctggcacc 695 

^^^^tetgacaaca aagtccatgt tetttttact atgccctaat acctttcatc agttatccac 755 

^-i^jattgatgcta catctgtatt ttataggtac cctatgttag gtgttctggg ggatagaaaa 815 

Ijgaaataagca ggccaggcte agtggctcat gectgtaatc ctageatttt gggaggctga 8 75 

Ijggcagcagaa ctgcctgage cceagggttc aagaetgeag tgagctatga tggcaccact 935 

.iigcattctagc etgggtgaea gageaagact etgtetaaaa taaaaaaaga gaaaaaaaaa 995 

\ p.aaa 999 



'^210> 132 
' <211> 725 
;;^^<212> DNA 
^,l<213> Homo sapiens 

i4 

-,K220> 
|'|c221> CDS 
?Jc222> 422 . .550 

<221> sig_j)eptide 
<222> 422 . .475 
<223> Von Heijne matrix 
score 4 . 5 

seq LRWLMPVIPALWG/AE 

<221> polyA_site 
<222> 714. .725 

<400> 132 

tctgcgaggg tgggagagaa aattaggggg agaaaggaca gagagagcaa etaccatcca 60 

tagecagata ggtgagtaaa tatatttgca gtaacctatt tgctattcct tgctgcaaet 120 

gtgtttaatg ttccttccag aatcagagag agtattgeca tccaagaaat cgtttttaga 180 

tatgaeattt gagctatcat cttgagacea atacctaaaa eaatttcagt ttaagaaatg 24 0 

tctaggtatg gtgaaaaeac agtttaaaae cagcaaaaea gaatttattg ccctcagcga 300 

ataeccaeaa tgtacatata ccttgtattt ctgaaagcaa agcaagcatg ceaagtagtt 360 

tttatttacc tgtacctata atacagcaag gtgaaaeagg atatattttt gaagtttaaa 420 

a atg tct tea ggc egg etg egg tgg etc atg cct gta ate cca gea ctt 469 
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Met Ser Ser Gly Arg Leu Arg Trp Leu Met Pro Val He Pro Ala Leu 
-15 -10 -5 

tgg gga gcc gag aag ggt gaa tea cct gag gtc age agt ttt gag acc 517 
Trp Gly Ala Glu Lys Gly Glu Ser Pro Glu Val Ser Ser Phe Glu Thr 

15 10 
agg ctg gcc aac atg gcg aaa ccc tgt etc tac tgaaaataea aaaattaget 570 
Arg Leu Ala Asn Met Ala Lys Pro Cys Leu Tyr 
15 20 25 

gggtgtggtg gcgggcgcct gtagteccag ctacttggga gaetgaggca ggagaattgc 630 
ttgaacacgg aaggcggaag ttgcagtaag ctgagatcgt gccaecgeac accagettgg 690 
gcaacagagt gagactccet cteaaaaaaa aaaaa 725 



<210> 133 
<211> 400 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 124. .231 

y;i<221> polyA^site 
ry=;222> 387. .400 

ir|<400> 133 

j;^tcgcetctc ctggcttctg gtatgeacca geaattcctg gcgttccttg getcctagaa 60 
ypcatcactcc tatcacatgg tcatettcac cctgtgtgtc ttcacactac cetttctetg 120 
[:!|:gc atg tct gcc ega ate cct ttt tat aag gac acc agt cag att aga 168 
'y Met Ser Ala Arg He Pro Phe Tyr Lys Asp Thr Ser Gin He Arg 

^5 10 15 

r^lita ggg tct acc ata ata cct cat ttt aac tta ate acc ttt gta aag 216 
vhsn Gly Ser Thr He He Pro His Phe Asn Leu He Thr Phe Val Lys 

20 25 30 

^•=^,aee ttt ttc caa ata tagteactct etgaggtact gatggttagg atetcaacat 271 
,rfhr Phe Phe Gin He 
01 35 

"accttttttg ggaggacaca attgaaccca taacagggtg tttgcaagga agagttaaaa 331 
tttgaaagaa aggtggtatt tgcttagata gatagggcac agctttctag gtgacaaaaa 391 
aaaaaaaaa >,nn 



<210> 134 

<211> 1053 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 131. .1051 

<221> sig_peptide 
<222> 131. .169 
<223> Von Heijne matrix 
score 4.2 
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seq MLAVSLTVPLLGA/MM 

<221> polyA_signal 
<222> 1019. .1024 



<400> 134 

gagcgaggcg gacgggctgc gacagcgccg gcccctgcgg ccgcaggtcg tcacagacga 60 
tgatggccag gccccggagg ctaaggacgg cagctccttt agcggcagag ttttccgagt 120 
gaccttcttg atg ctg get gtt tct etc acc gtt ccc ctg ctt gga gcc 169 
Met Leu Ala Val Ser Leu Thr Val Pro Leu Leu Gly Ala 
-10 -5 
atg atg ctg ctg gaa tct cct ata gat cca cag cct etc age tte aaa 217 
Met Met Leu Leu Glu Ser Pro lie Asp Pro Gin Pro Leu Ser Phe Lys 
15 10 15 

gaa ccc ccg etc ttg ctt ggt gtt ctg eat cca aat aeg aag ctg ega 265 
Glu Pro Pro Leu Leu Leu Gly Val Leu His Pro Asn Thr Lys Leu Arg 

20 25 30 

cag gca gaa agg ctg ttt gaa aat caa ctt gtt gga ccg gag tec ata 313 
Gin Ala Glu Arg Leu Phe Glu Asn Gin Leu Val Gly Pro Glu Ser lie 

35 40 45 

gca cat att ggg gat gtg atg ttt act ggg aca gca gat ggc egg gte 361 
Ala His He Gly Asp Val Met Phe Thr Gly Thr Ala Asp Gly Arg Val 
y 50 55 60 

|l%ta aaa ctt gaa aat ggt gaa ata gag acc att gee egg ttt ggt teg 409 

\Wal Lys Leu Glu Asn Gly Glu He Glu Thr He Ala Arg Phe Gly Ser 

Mi65 70 75 80 

Ir^ge cct tgc aaa acc ega gat gat gag cct gtg tgt ggg aga ccc ctg 457 

IJ^ly Pro Cys Lys Thr Arg Asp Asp Glu Pro Val Cys Gly Arg Pro Leu 

Jj 85 90 95 

jpgt ate egt gca ggg ccc aat ggg act etc ttt gtg gee gat gca tgc 505 

;"Gly He Arg Ala Gly Pro Asn Gly Thr Leu Phe Val Ala Asp Ala Cys 

100 105 110 

":j'>ag gga eta ttt gaa gta aat ccc tgg aaa egt gaa gtg aaa ctg ctg 553 
^I^Lys Gly Leu Phe Glu Val Asn Pro Trp Lys Arg Glu Val Lys Leu Leu 

115 120 125 

-ptg tec tec gag aca ccc att gag ggg aag aac atg tee ttt gtg aat 601 
JiLeu Ser Ser Glu Thr Pro He Glu Gly Lys Asn Met Ser Phe Val Asn 
130 135 140 

gat ctt aca gte tet cag gat ggg agg aag att tat tte ace gat tct 649 
Asp Leu Thr Val Ser Gin Asp Gly Arg Lys He Tyr Phe Thr Asp Ser 
145 150 155 160 

age age aaa tgg caa aga ega gae tac ctg ctt ctg gtg atg gag ggc 697 
Ser Ser Lys Trp Gin Arg Arg Asp Tyr Leu Leu Leu Val Met Glu Gly 

165 170 175 

aca gat gae ggg cgc ctg ctg gag tat gat act gtg ace agg gaa gta 745 
Thr Asp Asp Gly Arg Leu Leu Glu Tyr Asp Thr Val Thr Arg Glu Val 

180 185 190 

aaa gtt tta ttg gae cag ctg egg tte ccg aat gga gte cag ctg tet 793 
Lys Val Leu Leu Asp Gin Leu Arg Phe Pro Asn Gly Val Gin Leu Ser 

195 200 205 

cct gca gaa gae ttt gte ctg gtg gca gaa aca ace atg gee agg ata 841 
Pro Ala Glu Asp Phe Val Leu Val Ala Glu Thr Thr Met Ala Arg He 

210 215 220 

ega aga gte tac gtt tet ggc ctg atg aag ggc ggg get gat ctg ttt 889 
Arg Arg Val Tyr Val Ser Gly Leu Met Lys Gly Gly Ala Asp Leu Phe 
225 230 235 240 

gtg gag aac atg cct gga ttt cca gae aac ate egg ccc age age tet 937 
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Val Glu Asn Met Pro Gly Phe Pro Asp Asn He Arg Pro Ser Ser Ser 

245 250 255 

S99 ggg tac tgg gtg ggc atg teg acc ate cgc cat aac cat ggg ttt 985 
Gly Gly Tyr Trp Val Gly Met Ser Thr He Arg Pro Asn Pro Gly Phe 

260 265 270 

tec atg ctg gat ttc tta tct gag aga ecc tgg att aaa agg atg att 1033 
Ser Met Leu Asp Phe Leu Ser Glu Arg Pro Trp He Lys Arg Met He 

275 280 285 

ttt aag gca aaa aaa aaa aa 1053 
Phe Lys Ala Lys Lys Lys 
290 



<210> 135 

<211> 1128 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
,^ <222> 86. .403 

;lll<221> sigjpeptide 

n,k222> 86. .181 

Hk223> Von Heijne matrix 

yi seore 8 . 8 

,^;! seq VPMLLLIVGGSFG/LR 

rji|=221> polyA^signal 
<222> 1097. .1102 

||;'<:221> polyA_site 
nJc:222> 1117. .1128 

'^,|c400> 135 

;|;pgtettggtg agagcgtgag ctgetgagat ttgggagtct gcgctaggcc cgcttggagt 60 
ijjtetgagccga tggaagagtt caetc atg ttt gca ecc gcg gtg atg cgt get 112 

Met Phe Ala Pro Ala Val Met Arg Ala 
-30 -25 
ttt cgc aag aac aag act etc ggc tat gga gtc ecc atg ttg ttg ctg 160 
Phe Arg Lys Asn Lys Thr Leu Gly Tyr Gly Val Pro Met Leu Leu Leu 

-20 -15 -10 

att gtt gga ggt tct ttt ggt ett cgt gag ttt tet caa ate cga tat 208 
He Val Gly Gly Ser Phe Gly Leu Arg Glu Phe Ser Gin He Arg Tyr 

-5 15 
gat get gtg aag agt aaa atg gat cet gag ett gaa aaa aaa ctg aaa 256 
Asp Ala Val Lys Ser Lys Met Asp Pro Glu Leu Glu Lys Lys Leu Lys 
10 15 20 25 

gag aat aaa ata tct tta gag teg gaa tat gag aaa ate aaa gac tec 304 
Glu Asn Lys He Ser Leu Glu Ser Glu Tyr Glu Lys He Lys Asp Ser 

30 35 40 

aag ttt gat gac tgg aag aat att cga gga ecc agg cet tgg gaa gat 352 
Lys Phe Asp Asp Trp Lys Asn He Arg Gly Pro Arg Pro Trp Glu Asp 

45 50 55 

cet gac etc etc caa gga aga aat cca gaa age ett aag act aag aca 400 
Pro Asp Leu Leu Gin Gly Arg Asn Pro Glu Ser Leu Lys Thr Lys Thr 
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60 65 70 

act tgactctgct gattcttttt tccnnntttt ttttttttta aataaaaata 453 
Thr 

ctattaactg gacttcctaa tatatacttc tatcaagtgg aaaggaaatt ccaggcccat 513 

ggaaacttgg atatgggtaa tttgatgaca aataatcttc actaaaggtc atgtacaggt 573 

ttttatactt cccagctatt ccatctgtgg atgaaagtaa caatgttggc cacgtatatt 633 

ttacacctcg aaataaaaaa tgtgaatact gctccaaaaa aaaaaaccag taccgtgtag 693 

tctctctcgt ggcttggatt tacactgggc aacgtggttg gaatgtatct ggctcagaac 753 

tatgatatac caaacctggc taaaaaactt gaagaaatta aaaaggactt ggatgccaag 813 

aagaaacccc ctagtgcatg agactgcctc cagcactgcc ttcaggatat accgattcta 873 

ctgctcttga gggcctcgtt tactatctga accaaaagct tttgttttcg tctccagcct 933 

cagcacttct cttctttgct agaccctgtg ttttttgctt taaagcaagc aaaatggggc 993 

cccaatttga gaactacccg acgtttccaa catactcacc tcttcccata atccctttcc 1053 

aactgcatgg gaggttctaa gactggaatt atggtgctag attagtaaac atgactttta 1113 

acgaaaaaaa aaaaa 1128 



<210> 136 

<211> 254 

<212> DNA 

,^,,<213> Homo sapiens 

^^<22 0> 
nj<221> CDS 
Nk222> 37. .162 

Ji;k221> sigjpeptide 

hj<222> 37. .93 

|j^^<223> Von Heijne matrix 

score 9 . 5 
J seq LMCLSLCTAFALS/KP 

;;''k221> polyA_signal 
1"^^222> 224. .229 

;;J|I<221> polyA^site 
p||<222> 243 . .254 

<400> 136 

tgtgctgtgg gggctacgag gaaagatcta attatc atg gac ctg cga cag ttt 54 

Met Asp Leu Arg Gin Phe 
-15 

ctt atg tgc ctg tec ctg tgc aca gcc ttt gcc ttg age aaa ccc aca 102 
Leu Met Cys Leu Ser Leu Cys Thr Ala Phe Ala Leu Ser Lys Pro Thr 

-10 -5 1 

gaa aag aag gac cgt gta cat cat gag cct cag etc agt gac aag gtt 150 
Glu Lys Lys Asp Arg Val His His Glu Pro Gin Leu Ser Asp Lys Val 

5 10 15 

cac aat gat att tgatagaacc aattgttgta cataaaacag atctgcgcat 202 
His Asn Asp lie 
20 

atatatatat gtataaaaaa taataaaata atggaagatg aaaaaaaaaa aa 254 



<210> 137 
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<211> 886 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 31. .381 

<221> sig_j)eptide 
<222> 31. .90 
<223> Von Heijne matrix 
score 5.4 

seq AFVIACVLSLIST/IY 

<221> polyA_site 
<222> 875. .886 

<400> 137 

ggaggatggg cgagcagtct gaatggcaga atg gat aac cgt ttt get aca gca 54 

Met Asp Asn Arg Phe Ala Thr Ala 
-20 -15 
.;.,.^tt gta att get tgt gtg ctt age etc att tec aec ate tae atg gea 102 
'"^he Val He Ala Cys Val Leu Ser Leu He Ser Thr He Tyr Met Ala 

|%cc tee att ggc aca gac ttc tgg tat gaa tat cga agt eca gtt eaa 150 

rkla Ser He Gly Thr Asp Phe Trp Tyr Glu Tyr Arg Ser Pro Val Gin 

ifB 10 15 20 

4&aa aat tec agt gat ttg aat aaa age ate tgg gat gaa ttc att agt 198 

yj^lu Asn Ser Ser Asp Leu Asn Lys Ser He Trp Asp Glu Phe He Ser 

y1 25 3 0 3 5 

^Jgat gag gca gat gaa aag act tat aat gat gca ctt ttt cga tac aat 246 

[^Asp Glu Ala Asp Glu Lys Thr Tyr Asn Asp Ala Leu Phe Arg Tyr Asn 

40 45 50 

I'pgc aca gtg gga ttg tgg gga egg tgt ate ace ata cee aaa aac atg 294 
i^'fely Thr Val Gly Leu Trp Gly Arg Cys He Thr He Pro Lys Asn Met 
y 55 60 65 

y|:at tgg tat age cea cea gaa agg aca ggt att tct ctt att tta act 342 
m±s Trp Tyr Ser Pro Pro Glu Arg Thr Gly He Ser Leu He Leu Thr 
70 75 80 

tct gtc ttc ttc ace tgg tta ata ata gac aaa acg acg taatgattgc 391 

Ser Val Phe Phe Thr Trp Leu He He Asp Lys Thr Thr 

85 90 95 

ecaattacat gtaageaggt ttgttggttc tctetctcct taaagaaata aatcgtgtat 451 
cttetettte tactgccttc tctccccaac ttctttgeat taccatggta ctcatcaata 511 
ttggttggat gaggaacttt tcttatcttg ggaaagcctt aatggctttt ttttttctta 571 
tttactcact cattaaaata etttteatta ctctaacaca tgttataaag aaatagttgg 631 
aaaagtgeat egaaagaett ttaaaaatat ttggtaacta gtaaaaggae taceatcgaa 691 
aatcaactea aaaaattgtc cttttatggg ttagctgtat tataatacat atctatcatt 751 
tgeccetgtg tettagagga tataatttga ceagetctae atttaatetg tgtaattatg 811 
agactgtttt acaacaatet tgatgeagag ttggtaggtt aagaaatttg tattacagaa 871 
gttaaaaaaa aaaaa oog 



<210> 138 
<211> 1244 
<212> DNA 
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<213> Homo sapiens 

<220> 
<221> CDS 
<222> 46. .579 

<221> sig_peptide 
<222> 46. .156 
<223> Von Heijne matrix 
score 3 . 5 

seq LVFNFLLILTILT/IW 
<400> 138 

cccttatcca ggttnttatc tanggaatcc cnnnaagact gggga atg gag aga cag 57 

Met Glu Arg Gin 
-35 



tea 


agg 


gtt 


atg 


tea 


gaa 


aag 


gat 


gag 


tat 


cag 


ttt 


caa 


cat 


can 


nna 


105 


Ser 


Arg 


Val 


Met 


Ser 


Glu 


Lys 


Asp 


Glu 


Tyr 


Gin 


Phe 


Gin 


His 


Xaa 


Xaa 










-30 










-25 










-20 








gcg 


gng 


gan 


ctg 


ctt 


gtc 


tte 


aat 


ttt 


ttg 


etc 


ate 


ctt 


ace 


att 


ttg 


153 


Ala 


Xaa 


Xaa 


Leu 


Leu 


Val 


Phe 


Asn 


Phe 


Leu 


Leu 


He 


Leu 


Thr 


He 


Leu 








-15 










-10 










-5 










^||;aca 


ate 


tgg 


tta 


ttt 


aaa 


aat 


cat 


ega 


tte 


ege 


tte 


ttg 


eat 


gaa 


act 


201 




lie 


Trp 


Leu 


Phe 


Lys 


Asn 


His 


Arg 


Phe 


Arg 


Phe 


Leu 


His 


Glu 


Thr 






1 








5 










10 










15 




= jigga 


gga 


gea 


acg 


~ j_ 

gtg 


tat 


ggc 


ctt 


ata 


atg 


gga 


eta 


att 


tea 


ega 


tat 


249 


[ply 


Gly 


Ala 


Met 


Val 


Tyr 


Gly 


Leu 


He 


Met 


Gly 


Leu 


He 


Ser 


Arg 


Tyr 












20 










25 










30 






i^Ct 


aca 


gea 


eca 


act 


gat 


att 


gaa 


agt 


gga 


act 


gtc 


tgt 


gae 


tgt 


gta 


297 




Thr 


Ala 


Pro 


Thr 


Asp 


He 


Glu 


Ser 


Gly 


Thr 


Val 


Cys 


Asp 


Cys 


Val 










35 










40 










45 








^^aaa 


eta 


act 


tte 


agt 


eca 


eca 


act 


ctg 


ctg 


gtt 


aat 


gtc 


act 


gae 


caa 


345 




Leu 


Thr 


Phe 


Ser 


Pro 


Pro 


Thr 


Leu 


Leu 


Val 


Asn 


Val 


Thr 


Asp 


Gin 








50 










55 










60 








=%tt 


tat 


gaa 


tat 


aaa 


tae 


aaa 


aga 


gaa 


ata 


agt 


cag 


cac 


aac 


ate 


aat 


393 


^,|^al 


Tyr 


Glu 


Tyr 


Lys 


Tyr 


Lys 


Arg 


Glu 


He 


Ser 


Gin 


His 


Asn 


He 


Asn 






65 










70 










75 












ipct 


cat 


caa 


gga 


aat 


get 


ata 


ctt 


gaa 


aag 


atg 


aea 


ttt 


gat 


cea 


gaa 


441 


'pro 


His 


Gin 


Gly 


Asn 


Ala 


He 


Leu 


Glu 


Lys 


Met 


Thr 


Phe 


Asp 


Pro 


Glu 




80 










85 










90 










95 




ate 


tte 


tte 


aat 


gtt 


tta 


ctg 


cea 


cea 


att 


ata 


ttt 


cat 


gea 


gga 


tat 


489 


lie 


Phe 


Phe 


Asn 


Val 


Leu 


Leu 


Pro 


Pro 


He 


He 


Phe 


His 


Ala 


Gly 


Tyr 












100 










105 










110 






agt 


eta 


aag 


aag 


aga 


cac 


ttt 


ttt 


caa 


aac 


tta 


gga 


tet 


att 


tta 


acg 


537 


Ser 


Leu 


Lys 


Lys 


Arg 


His 


Phe 


Phe 


Gin 


Asn 


Leu 


Gly 


Ser 


He 


Leu 


Thr 










115 










120 










125 








tat 


gee 


tte 


ttg 


gga 


act 


gee 


ate 


tee 


tge 


ate 


gtc 


ata 


ggg 






579 


Tyr 


Ala 


Phe 


Leu 


Gly 


Thr 


Ala 


He 


Ser 


Cys 


He 


Val 


He 


Gly 









130 135 140 



taagtgacat tcggagetea agttgeaggt ggctgtgggg tetgtgatct gtgtgaggga 639 

tetaaeaett ceaggattet tgetggetgg gaaaattgtc ttttttttag tatatcaeat 699 

atttgtatgt tttttctgac ttaattecac ggcttetgac aaataeaagg ettcaaatca 759 

aageaaacta gaggattget ggaetttete tgtgagttet ggaettctga cttagggaat 819 

gtggatcact tgccttgagt tatgtgaagc geattgcatt cttettttag tttgagtaat 879 

gcegatatgg teactgcatt cttttttgtc ttgtattgag agaccttace tgtatttggc 939 

aggagtgcaa aagtaactat atgecaagag ttttctttet aaaggaaagt ttaeaagaca 999 

geagtctgaa acagatatgn teeaaatatn naacagagtt gcttaataca gggatagett 1059 
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ttcagttaat accctgtaga atgcagactc tttntttcat tgtattttct tgattatgct 1119 

actgagccct aagtcacacg ttatatactc tggcttgcag ctcatcataa agtaaaatgt 1179 

ggtaccaaat ggtgaaggca atccagcctn tgataatccc gtccaataca ttaaagntcc 1239 

actgc 1244 



<210> 139 

<211> 471 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 92 . .469 

<221> sig_j)eptide 
<222> 92 . ,172 
<223> Von Heijne matrix 
score 7 . 9 

seq WVLALGFLGCYG/AK 

^;;^221> polyA^signal 
^222> 454. .459 

^'^^221> polyA_site 
J'k222> 458. .471 

jk400> 13 9 

fpcaagtgcag aagtcggtga cggtgggcat ctgggtgtca atcgatgggg catcctttct 60 
^ gaagatcttc gggccactgt cgtccagtgc c atg cag ttt gtc aac gtg ggc 112 

Met Gin Phe Val Asn Val Gly 
Z -25 
'^tac ttc etc ate gca 
''^yr Phe Leu lie Ala 
%i2 0 

tgc tat ggt get 
]|31y Cys Tyr Gly Ala 
1 

ttc ttc ate etc etc 
Phe Phe lie Leu Leu 
15 

gtc gee ctg gtg tac 
Val Ala Leu Val Tyr 
30 

gta gtg cct gee ate 
Val Val Pro Ala He 
45 

caa gtg tgg aac ace 
Gin Val Trp Asn Thr 
65 

aac tat acg gat ttt 
Asn Tyr Thr Asp Phe 
80 

gtt aea atg aaa aaa 
Val Thr Met Lys Lys 
95 



gee ggc gtt gtg gtc ett get ctt ggt ttc ctg 160 

Ala Gly Val Val Val Leu Ala Leu Gly Phe Leu 

-15 -10 -5 

aag act gag age atg tgt gee etc gtg acg ttc 2 08 

Lys Thr Glu Ser Met Cys Ala Leu Val Thr Phe 

5 10 
etc ate ttc att get gag gtt gca get get gtg 256 
Leu He Phe He Ala Glu Val Ala Ala Ala Val 

20 25 
aee aca atg get gag cac ttc ctg acg ttg ctg 3 04 

Thr Thr Met Ala Glu His Phe Leu Thr Leu Leu 

35 40 
aag aaa gat tat ggt tec cag gaa gac ttc act 352 
Lys Lys Asp Tyr Gly Ser Gin Glu Asp Phe Thr 
50 55 60 

ace atg aaa ggg etc aag tgc cgt ggc ttc aee 400 
Thr Met Lys Gly Leu Lys Cys Arg Gly Phe Thr 

70 75 
gag gac tea ccc tac ttc aaa atg cat aaa cct 448 
Glu Asp Ser Pro Tyr Phe Lys Met His Lys Pro 

85 90 
aaa aaa aa 471 
Lys Lys 



-123- 



<210> 140 
<211> 849 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 154. .675 



<221> sig_j)eptide 
<222> 154. .498 
<223> Von Heijne matrix 
score 4 . 8 

seq PLRLLNLLILIEG/GV 



<221> polyA_signal 
<222> 819. .824 



.,,,<:221> polyA_site 
^^:|c222> 838. .849 

nic400> 140 

i^^^cccctatctc cagacctcat tcgcaatgaa gtagaatgtc tgaaagcaga tttcaaccac 60 

U'kgaatcaagg aggttctctt caactccctc ttcagtgcct actatgttgc atttctcccc 120 

4Ifctgtgttttg tgaagagtac ccagtactat gac atg cgc tgg tea tgt gag cac 174 
|ij Met Arg Trp Ser Cys Glu His 

[11 -115 -110 



etc 


gtt 


atg 


gtg 


tgg 


ate 


aat 


get 


ttt 


gtc 


atg 


etc 


ace 


acg 


caa 


etg 


222 


Xeu 


Val 


Met 


Val 


Trp 


He 


Asn 


Ala 


Phe 


Val 


Met 


Leu 


Thr 


Thr 


Gin 


Leu 










-105 








-100 








-95 








^^ttg 


cca 


tec 


aaa 


tac 


tgt 


gat 


ttg 


eta 


cat 


aaa 


tea 


get 


get 


cac 


etg 


270 




Pro 


Ser 


Lys 


Tyr 


Cys 


Asp 


Leu 


Leu 


His 


Lys 


Ser 


Ala 


Ala 


His 


Leu 








-90 










-85 










-80 










ffegc 


aag 


tgg 


eag 


aag 


ttg 


gaa 


cat 


ggg 


tec 


tac 


age 


aat 


get 


cca 


eag 


318 




Lys 


Trp 


Gin 


Lys 


Leu 


Glu 


His 


Gly 


Ser 


Tyr 


Ser 


Asn 


Ala 


Pro 


Gin 




-75 










-70 










-65 












cac 


att 


tgg 


tea 


gaa 


aat 


aca 


ata 


tgg 


cct 


caa 


ggg 


gtg 


etg 


gtg 


egg 


366 


His 


He 


Trp 


Ser 


Glu 


Asn 


Thr 


He 


Trp 


Pro 


Gin 


Gly 


Val 


Leu 


Val 


Arg 




-60 










-55 










-50 










-45 




cac 


age 


aga 


tgt 


tta 


tat 


aga 


gee 


atg 


ggg 


cct 


tac 


aae 


gtg 


gca 


gtg 


414 


His 


Ser 


Arg 


Cys 


Leu 


Tyr 


Arg 


Ala 


Met 


Gly 


Pro 


Tyr 


Asn 


Val 


Ala 


Val 












-40 










-35 










-30 






cct 


tea 


gat 


gta 


tet 


eat 


gee 


cgc 


ttt 


tat 


tte 


tta 


ttt 


eat 


cga 


cca 


462 


Pro 


Ser 


Asp 


Val 


Ser 


His 


Ala 


Arg 


Phe 


Tyr 


Phe 


Leu 


Phe 


His 


Arg 


Pro 










-25 










-20 










-15 








tta 


agg 


etg 


tta 


aat 


etg 


etc 


ate 


ett 


att 


gag 


ggc 


ggt 


gtc 


gtc 


tte 


510 


Leu 


Arg 


Leu 
-10 


Leu 


Asn 


Leu 


Leu 


He 
-5 


Leu 


He 


Glu 


Gly 


Gly 


Val 


Val 


Phe 




tat 


eag 


etc 


tat 


tec 


ttg 


etg 


egg 


teg 


gag 


aag 


tgg 


1 

aae 


cac 


aca 


ett 


558 


Tyr 


Gin 


Leu 


Tyr 


Ser 


Leu 


Leu 


Arg 


Ser 


Glu 


Lys 


Trp 


Asn 


His 


Thr 


Leu 




5 










10 










15 










20 




tec 


atg 


get 


etc 


ate 


etc 


tte 


tge 


aae 


tac 


tat 


gtt 


tta 


ttt 


aaa 


ett 


606 


Ser 


Met 


Ala 


Leu 


He 


Leu 


Phe 


Cys 


Asn 


Tyr 


Tyr 


Val 


Leu 


Phe 


Lys 


Leu 












25 










30 










35 
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ctc egg gac aga at a gta tta ggc agg gca tac tec tac cca etc aac 654 
Leu Arg Asp Arg lie Val Leu Gly Arg Ala Tyr Ser Tyr Pro Leu Asn 

40 45 50 

agt tat gaa etc aag gca aae taagetgcct cteaaeaatg agggagaact 705 
Ser Tyr Glu Leu Lys Ala Asn 
55 

cagataaaaa tattttcata cgttetattt ttttcttgtg atttttataa atatttaaga 765 
tgttttatat tttgtatact attatgtttt gaaagteggg aagagtaagg gatattaaat 825 
gtatcegtaa acaaaaaaaa aaaa 849 



<210> 141 
<211> 155 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -31. . -1 

^^<400> 141 



'^'■Met Phe 


Thr 


Ser 


Thr Gly Ser Ser Gly Leu Tyr Lys Ala Pro Leu Ser 


;M -30 










-25 










-20 






iys Ser 


Leu 


Leu 


Leu 


Val 


Pro 


Ser 


Ala 


Leu 


Ser 


Leu Leu 


Leu Ala 


Leu 










-10 










-5 






1 


J'lLeu Leu 


Pro 


His 


Cys 


Gin 


Lys 


Pro 


Phe 


Val 


Tyr 


Asp Leu 


His Ala 


Val 






5 










10 








15 




'rhys Asn 


Asp 


Phe 


Gin 


He 


Trp 


Arg 


Leu 


He 


Cys 


Gly Arg 


He He 


Cys 




20 










25 








30 






''ijeu Asp 


Leu 


Lys 


Asp 


Thr 


Phe 


Cys 


Ser 


Ser 


Leu 


Leu He 


Tyr Asn 


Phe 


35 










40 










45 






;;'Arg He 


Phe 


Glu 


Arg 


Arg 


Tyr 


Gly 


Ser 


Arg 


Lys 


Phe Ala 


Ser Phe 


Leu 










55 










60 






65 


^^^Leu Gly 


Thr 


Trp 


Val 


Leu 


Ser 


Ala 


Leu 


Phe 


Asp 


Phe Leu 


Leu He 


Glu 


''"'4 






70 










75 






80 




mia Met 


Gin 


Tyr 


Phe 


Phe Gly 


He 


Thr 


Ala 


Ala 


Ser Asn 


Leu Pro 


Ser 






85 










90 








95 




"'Gly Leu 


He 
100 


Phe 


Cys 


Cys 


Ala 


Phe 
105 


Cys 


Ser 


Glu 


Thr Lys 
110 


Leu Phe 


Leu 


Ser Arg 


Gin 


Ala 


Met 


Ala 


Glu 


Asn 


Phe 


Ser 


He 








115 










120 

















<210> 142 
<211> 55 
<212> PRT 

<213> Homo sapiens 
<400> 142 

Met Ala Asp Phe Tyr Lys Glu Phe 
1 5 
Met Tyr Tyr Asn Arg Asp Trp Tyr 
20 

Phe Met Gly Lys Val Ala Leu Glu 
35 40 



Leu Ser Lys Asn Phe Gin Lys Arg 

10 15 
Lys Arg Asn Phe Ala He Thr Phe 
25 30 
Arg He Trp Asn Lys Leu Lys Gin 
45 
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Lys Gin Lys Lys Arg Ser Asn 
50 55 



<210> 143 
<211> 67 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -20 . , -1 

<400> 143 



Met Ser Arg 


Asn 


Leu Arg 


Thr 


Ala 


Leu He Phe Gly Gly 


Phe 


He 


Ser 


-20 






-15 






-10 






-5 


Leu lie 


Gly 


Ala 


Ala Phe 


Tyr 


Pro 


He Tyr Phe Arg Pro 


Leu 


Met 


Arg 








1 






5 


10 






Leu Glu 


Glu 


Tyr 


Lys Lys 


Glu 


Gin 


Ala He Asn Arg Ala 


Gly 


He 


Val 




15 








20 


25 








^Gln Glu Asp 


Val 


Gin Pro 


Pro 


Gly 


Leu Lys Val Trp Ser 


Asp 


Pro 


Phe 


3 0 








35 




40 








^^^ly Arg Lys 

Us 


















;|c210> 144 


















Jf211> 198 


















'<212> PRT 


















„<213> Homo sapiens 














^fe220> 




















=^^22 1> SIGNAL 
















^,|:222> -21. 


-1 
















i^^:400> 144 


















Met Pro 


Val 


Pro 


Ala Leu 


Cys 


Leu 


Leu Trp Ala Leu Ala 


Met 


Val 


Thr 


-20 








-15 




-10 








Arg Pro 


Ala 


Ser 


Ala Ala 


Pro 


Met 


Gly Gly Pro Glu Leu 


Ala 


Gin 


His 


-5 






1 






5 




10 




Glu Glu 


Leu 


Thr 


Leu Leu 


Phe 


His 


Gly Thr Leu Gin Leu 


Gly 


Gin 


Ala 






15 








20 


25 






Leu Asn 


Gly 


Val 


Tyr Arg 


Thr 


Thr 


Glu Gly Trp Leu Thr 


Lys 


Ala 


Arg 




30 








35 


40 








Asn Ser 


Leu 


Gly 


Leu Tyr Gly Arg 


Thr He Glu Leu Leu 


Gly 


Gin 


Glu 


45 








50 




55 








Val Ser 


Arg 


Gly 


Arg Asp 


Ala 


Ala 


Gin Glu Leu Arg Ala 


Ser 


Leu 


Leu 


60 






65 






70 






75 


Glu Thr 


Gin 


Met 


Glu Glu Asp 


He 


Leu Gin Leu Gin Ala 


Glu 


Ala 


Thr 








80 






85 




90 




Ala Glu 


Val 


Leu 


Gly Glu Val Ala 


Gin Ala Gin Lys Val 


Leu 


Arg Asp 






95 








100 


105 






Ser Val 


Gin 


Arg 


Leu Glu 


Val 


Gin 


Leu Arg Ser Ala Trp 


Leu 


Gly Pro 




110 








115 


120 








Ala Tyr 


Arg 


Glu 


Phe Glu 


Val 


Leu 


Lys Ala His Ala Asp 


Lys 


Gin 


Ser 
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125 130 135 

His lie Leu Trp Ala Leu Thr Gly His Val Gin Arg Gin Arg Arg Glu 
140 145 150 155 

Met Val Ala Gin Gin His Arg Leu Arg Gin lie Gin Glu Arg Leu His 

160 165 170 

Thr Ala Ala Leu Pro Ala 
175 



<210> 145 
<211> 135 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -25. .-1 



<400> 145 



Met 


Ser 


Leu 


Arg 


Asn 


Leu 


Trp 


Arg Asp Tyr 


Lys 


Val Leu Val Val Met 












-20 








-15 


-10 


^.iVal 


Pro 


Leu 


Val 


Gly Leu 


He 


His 


Leu Gly Trp Tyr Arg He Lys Ser 










-5 








1 




5 


I uBer 


Pro 


Val 


Phe 


Gin 


He 


Pro 


Lys 


Asn Asp 


Asp 


He Pro Glu Gin Asp 






10 










15 






20 




Leu 


Gly Leu Ser Asn Leu Gin Lys Ser 


Gin 


He Gin Gly Lys Xaa 




25 










30 








35 


hAla 


Gly 


Leu 


Gin 


Ser 


Ser Gly Lys 


Glu Ala 


Ala 


Leu Asn Leu Ser Phe 


M^O 










45 








50 


55 


' tie 


Ser 


Lys 


Glu 


Glu 


Met 


Lys 


Asn 


Thr Ser 


Trp 


He Arg Lys Asn Trp 


j'^'Leu 








60 








65 




70 


Leu 


Val 


Ala 


Gly 


He 


Ser 


Phe 


He Gly Asp His Leu Gly Thr Tyr 








75 










80 




85 


Hi>he 


Leu 


Gin 


Arg 


Ser 


Ala 


Lys 


Gin 


Ser Val 


Lys 


Phe Gin Ser Gin Ser 






90 










95 






100 




Gin 


Lys 


Ser 


lie 


Glu 


Glu 












105 










110 











<210> 146 
<211> 255 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -70. .-1 

<400> 146 

Met Gin Gin Lys Glu Gin Gin Phe Arg Glu Trp Phe Leu Lys Glu Phe 
-70 -65 -60 -55 

Pro Gin He Arg Trp Lys He Gin Glu Ser He Glu Arg Leu Arg Val 

-50 -45 -40 

He Ala Asn Glu He Glu Lys Val His Arg Gly Cys Val He Ala Asn 
-35 -30 -25 
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V Ct JL 


V CI -L 


Ser Gly 


OCX 


Thr Gly He 


IjCU. Ocl. VCtX 


He Gly Val Met Leu 






-20 










-15 






-10 










Phe 


Thr 


i-iXCt. 


Gly Leu Ser 


JLicU. OCX XXC 


Thr Ala Ala Gly Val 














T 
X 




IT 








XvJ 




Leu 


Gly He 


^xa 


Ser 


ii±a 


inr 


7\1iJ r*l-^r TTi=i 

/\j.a oxy xxe 


Ala 


Ser 


Ser 


He Val 


















^ u 








^ o 




x^toli 


iiir 


Tyr 


X XIJL 


Arg 


Ser 


ii±a 


oxu. xicu mx 


Ala 


Ser Arg 


Leu Thr 








J u 










oO 






40 




Ala 




Ser 


Thr 




Gin 


Leu 


Glu 


nXd Xitr U. i-iX^ 


Asp 


He 


Leu 


His Asp 
















50 






55 






lie 


Thir 


Pro 


Asn 


Val 


Leu 


Ser 


Phe 


£-lXCt XlCU. zn,to^ 


Pne 


Asp 


Glu 


Ala Thr 




o u 










65 






•7 n 

/ U 








T ,\TC* 

xjy to 




lie 


Ala 


Asn 


Asp 


Val 


His 


iiix xicu i^rg 


Arg 


Ser 


Lys 


Ala Thr 


/ D 










80 






bo 








90 


V d-L 




Arg 


Pro 


Leu 


He 


Ala 


Trp 


iixg lyr vax 


Pro 


He 


Asn 


Val Val 










y b 








100 








105 


r*l 11 




Leu 


Arg 


Tlir 


Arg Gly Ala 


Pro Thr Arg 


He 


Val 


Arg 


Lys Val 








110 










115 






120 




i-i-Lct 


Arg 


Asn 


Leu 


vjXy 


Lys 


Ala 


Thr 


oer v^xy vax 


Leu 


Val 


Val 


Leu Asp 






125 










130 






135 






Val 


Val 


Asn 


Leu 


Val 


Gin Asp 


Ser 


Leu Asp Leu 


His 


Lys 


Gly Glu Lys 




140 










145 






150 










Glu 


Ser 


Ala 


Glu 


Leu 


Leu Arg 


Gin Trp Ala 


Gin 


Glu 


Leu 


Glu Glu 


|;|L55 










160 






165 








170 




Leu 


Asn 


Glu 


Leu 


Thr 


His 


He 


His Gin Ser 


Leu Lys Ala Gly 










175 








180 








185 



j^210> 147 

'<211> 59 

^<212> PRT 

^'>:213> Homo sapiens 
=^^<220> 

^j|c221> SIGNAL 

jj:222> --49. .-1 

'<400> 147 

Met Pro Gly Thr Glu Val Leu Glu Gly Ala Thr Asp Gly Leu Ala Ala 

-45 -40 -35 

He Asn Leu Leu Lys Trp He Lys Thr Leu Gly Gly Ser Val He Ser 

-30 -25 -20 

Met He Val Leu Leu He Cys Val Val Cys Leu Tyr He Val Cys Arg 

-15 -10 -5 

Cys Gly Ser His Leu Trp Arg Glu Ser His His 
15 10 



<210> 148 
<211> 180 
<212> PRT 
<213> Homo sapiens 

<400> 148 

Met Cys He Ser Gly Leu Cys Gin He Val Gly Cys Asp His Gin Leu 
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1 








c 
D 




















±o 




Ser 


Thr 


20 


Lys 


VjXU 


Asp 


Asn 


Cys 

ZD 




V a.X 


\--yo 


Asn 






Ser 


Thr 


Cys 
35 


Arg 


Leu 


Val 


Arg 


Gly 
40 


Gin 


Tyr 


Lys 


Ser 


Gin 
45 


Leu 


Ser Ala 


Thr 


Lys 
50 


Ser 


Asp 


Asp 


Thr 


Val 
55 


Val 


Ala 


He 


Pro 


Tyr 
60 


Gly 


Ser 


Arg His 


He 


Arg 


Leu 


Val 


Leu 


Lys 


Gly 


Pro 


Asp 


His 


Leu 


Tyr 


Leu 


Glu 


Thr Lys 


65 










70 










75 








80 


Thr 


Leu 


Gin 


Gly 


Thr 
85 


Lys 


Gly 


Glu 


Asn 


Ser 

9 0 


Leu 


Ser 


Ser 


Thr 


Gly Thr 

95 




Leu 


V3i± 


Asp 
100 


Asn 


Ser 


Ser 


va J. 


Asp 
105 


Fne 


tjixn 


Lys 


jfne 


Pro 
110 


Asp Jbys 


Glu 


He 


Leu 
115 


Arg 


Met 


Ala 


Gly 


Pro 
120 


Leu 


Thr 


Ala 


Asp 


Phe 
125 


He 


Val Lys 


He 


Arg 
130 


Asn 


Ser 


Gly 


Ser 


Ala 
135 


Asp 


Ser 


Thr 


Val 


Gin 
140 


Phe 


He 


Phe Tyr 


Gin 


Pro 


He 


He 


His 


Arg 


Trp 


Arg 


Glu 


Thr 


Asp 


Phe 


Phe 


Pro 


Cys Ser 


145 










150 










155 








160 


Ala 


Thr 


Cys 


Gly 


Gly 
165 


Gly 


Tyr 


Gin 


Leu 


Thr 
170 


Ser 


Ala 


Glu 


Cys 


Tyr Asp 
175 



.rjjeu Arg Ser Asn 



jk210> 149 
p^211> 162 
Jj<212> PRT 
ji<213> Homo sapiens 

,<220> 

fin 

, ,<221> SIGNAL 
■;^<222> -23 . . -1 

^J<400> 149 

l)^et Gly Asp Lys He Trp Leu Pro Phe Pro Val Leu Leu Leu Ala Ala 
f\ -20 -15 -10 

Leu Pro Pro Val Leu Leu Pro Gly Ala Ala Gly Phe Thr Pro Ser Leu 

-5 15 
Asp Ser Asp Phe Thr Phe Thr Leu Pro Ala Gly Gin Lys Glu Cys Phe 
10 15 20 25 

Tyr Gin Pro Met Pro Leu Lys Ala Ser Leu Glu He Glu Tyr Gin Val 

30 35 40 

Leu Asp Gly Ala Gly Leu Asp He Asp Phe His Leu Ala Ser Pro Glu 

45 50 55 

Gly Lys Thr Leu Val Phe Glu Gin Arg Lys Ser Asp Gly Val His Thr 

60 65 70 

Val Glu Thr Glu Val Gly Asp Tyr Met Phe Cys Phe Asp Asn Thr Phe 

75 80 85 

Ser Thr He Ser Glu Lys Val He Phe Phe Glu Leu He Pro Asp Asn 
90 95 100 105 

Met Gly Glu Gin Ala Gin Glu Gin Glu Asp Trp Lys Lys Tyr He Thr 

110 115 120 

Gly Thr Asp He Leu Asp Met Lys Leu Glu Asp He Leu Val Ser Met 
125 130 135 

Val Phe 
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<210> 150 

<211> 120 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 

<222> -23 . . -1 

<400> 150 

Met Gly Asp Lys lie Trp Leu Pro Phe Pro Val Leu Leu Leu Ala Ala 

-20 -15 -10 

Leu Pro Pro Val Leu Leu Pro Gly Ala Ala Gly Phe Thr Pro Ser Leu 

-5 15 
Asp Ser Asp Phe Thr Phe Thr Leu Pro Ala Gly Gin Lys Glu Cys Phe 
10 15 20 25 

Tyr Gin Pro Met Pro Leu Lys Ala Ser Leu Glu lie Glu Tyr Gin Val 
30 35 40 

:.!fieu Asp Gly Ala Gly Leu Asp lie Asp Phe His Leu Ala Ser Pro Glu 

45 50 55 

f'Dly Lys Thr Leu Val Phe Glu Gin Arg Lys Ser Asp Gly Val His Thr 

60 65 70 

''feys lie Arg Ser Lys Asn Gly Pro Gly Thr Ala Val His Ala Tyr Asn 
J1 75 80 85 

|5*ro Ser Thr Phe Arg Gly Gin Val 
]B0 95 



.:<210> 151 

':^211> 7 

'^<212> PRT 

''*l<213> Homo sapiens 

lj<400> 151 
Met Val Glu Met Thr Gly Val 
1 5 



<210> 152 
<211> 199 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -42. .-1 

<400> 152 

Met Asp Gly Gin Lys Lys Asn Trp Lys Asp Lys Val Val Asp Leu Leu 

-40 -35 -30 

Tyr Trp Arg Asp He Lys Lys Thr Gly Val Val Phe Gly Ala Ser Leu 
-25 -20 -15 
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Phe Leu Leu Leu Ser Leu Thr Val Phe Ser lie Val Ser Val Thr Ala 

-10 -5 15 

Tyr lie Ala Leu Ala Leu Leu Ser Val Thr lie Ser Phe Arg lie Tyr 

10 15 20 

Lys Gly Val lie Gin Ala lie Gin Lys Ser Asp Glu Gly His Pro Phe 

25 30 35 

Arg Ala Tyr Leu Glu Ser Glu Val Ala lie Ser Glu Glu Leu Val Gin 

40 45 50 

Lys Tyr Ser Asn Ser Ala Leu Gly His Val Asn Cys Thr lie Lys Glu 
55 60 65 70 

Leu Arg Arg Leu Phe Leu Val Asp Asp Leu Val Asp Ser Leu Lys Phe 

75 80 85 

Ala Val Leu Met Trp Val Phe Thr Tyr Val Gly Ala Leu Phe Asn Gly 

90 95 100 

Leu Thr Leu Leu lie Leu Ala Leu lie Ser Leu Phe Ser Val Pro Val 

105 110 115 

lie Tyr Glu Arg His Gin Ala Gin lie Asp His Tyr Leu Val Leu Ala 

120 125 130 

Asn Lys Asn Val Lys Asp Ala Met Ala Lys lie Gin Ala Lys lie Pro 
135 140 145 150 

Gly Leu Lys Arg Lys Ala Glu 
155 



^=%210> 153 

J1:211> 43 

|;<:212> PRT 

jjc213> Homo sapiens 

'<400> 153 

■^Met Pro Phe Arg Met Ser Gly Tyr lie Pro Phe Gly Thr Pro lie Val 

5 10 15 

'^er Val Thr Phe Lys Gly Phe Pro Phe Leu Lys Asn Tyr Phe Lys Cys 

20 25 30 

■ieu Thr Leu Cys Tyr Cys Ser Arg Val Phe Asp 
J;i 35 40 



<210> 154 
<211> 50 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -37 . . -1 

<400> 154 

Met Glu Trp Ala Gly Lys Gin Arg 
-35 -30 
Gly Trp Asp His Leu Ala Ser Phe 

-20 -15 
Ser Gly Ser Gin Ala Ser Val Cys 
-5 1 
Gin Glu 



Asp Phe Gin Val Arg Ala Ala Pro 
-25 

Pro Gly Pro Ser Leu Arg Leu Phe 
-10 

Ser Leu Cys Ser Gly Phe Gly Ala 
5 10 
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<210> 155 
<211> 153 
<212> PRT 

<213> Homo sapiens 
<400> 155 



Thr Val Pro Leu Leu Leu Glu Pro 


Ala 


Asp 


His 


Ala 


Arg 


Gly 


Arg Ala 


1 5 




10 










15 




His Val His Leu Pro Glu Asn Val 


Arg 


Ser 


Gin 


Ser 


Pro 


Gly 


His 


Val 


20 


25 










30 






Arg Arg Gly Arg Ser Gly Ala Gin Val 


Leu 


Pro 


Thr Gly 


Pro 


Asp 


Glu 


35 40 










45 








Lys Gin Val Glu Lys Ser Glu Val 


Asp 


Phe 


Ser 


Lys 


Ser 


His 


Ser 


Leu 


50 55 








60 










Val Arg Arg Phe Glu Asp Leu Lys 


Pro 


Lys 


Leu 


Ser 


Val 


Cys 


Lys 


Thr 


65 70 






75 










80 


Gly Ser Gin Val Phe Arg Ser Glu 


Asn 


Trp 


Lys 


Val 


Trp 


Ala 


Glu 


Ser 


85 




90 










95 




^^Ser Arg Gly Asp His Asp Asp Cys 


Leu Asp 


Leu 


Cys 


Ser 


Val 


Leu 


Cys 


ip 100 


105 










110 






ySrp Gly Glu Leu Leu Arg Thr lie 


Pro 


Glu 


He 


Pro 


Pro 


Lys 


Arg Gly 


ill 115 120 










125 








|:=Glu Leu Lys Thr Glu Leu Leu Gly Leu Lys 


Glu Arg Lys 


His 


Lys 


Pro 


Ijl 13 0 135 








140 










In Val Ser Gin Gin Glu Glu Leu 


Lys 
















|;i45 150 


















^;;\210> 156 


















n,l:211> 67 


















Nk212> PRT 


















■■-•,^213 > Homo sapiens 


















r?f400> 156 


















Met Arg Gin Lys Arg Lys Gly Asp Leu 


Ser 


Pro 


Ala 


Lys 


Leu 


Met 


Met 


1 5 




10 










15 




Leu Thr lie Gly Asp Val lie Lys 


Gin 


Leu 


He 


Glu 


Ala 


His 


Glu 


Gin 


20 


25 










30 






Gly Lys Asp lie Asp Leu Asn Lys 


Val 


Arg 


Thr 


Lys 


Thr 


Ala 


Ala 


Lys 


35 40 










45 






Tyr Gly Leu Ser Ala Gin Pro Arg 


Leu 


Val 


Asp 


He 


He 


Ala 


Ala 


Val 



50 55 60 

Pro Pro Glu 
65 



<210> 157 
<211> 87 
<212> PRT 

<213> Homo sapiens 



<400> 157 



-132- 



Met Asp Glu Leu Ser Glu Glu Asp 
1 5 
Lys lie Gin Arg Phe Leu Ser Gin 
20 

Thr Gly His Met Gly Lys Leu Val 

35 40 
Phe Gin Gin He Leu Ala Gly Glu 

50 55 
Phe Tyr Met Val Gly Pro He Glu 
65 70 
Leu Ala Glu Glu His Ser Ser 
85 



Lys Leu Thr Val Ser Arg Ala Arg 

10 15 
Pro Phe Gin Val Ala Glu Val Phe 
25 30 
Pro Leu Lys Glu Thr lie Lys Gly 
45 

Tyr Asp His Leu Pro Glu Gin Ala 
60 

Glu Ala Val Ala Lys Ala Asp Lys 
75 80 



<210> 158 
<211> 250 
<212> PRT 
<213> Homo sapiens 

<220> 
,,,,<221> SIGNAL 
^A:222> -85. .-1 

\k400> 158 

^iMet Ser Ala Glu Val Lys Val Thr Gly Gin Asn Gin Glu Gin Phe Leu 
j'hSS -80 -75 -70 

ff^en Leu Ala Lys Ser Ala Lys Gly Ala Ala Leu Ala Thr Leu He His 
:| -65 -60 -55 

"^In Val Leu Glu Ala Pro Gly Val Tyr Val Phe Gly Glu Leu Leu Asp 
-50 -45 -40 

Met Pro Asn Val Arg Glu Leu Xaa Ala Arg Asn Leu Pro Pro Leu Thr 

-35 -30 -25 

^.blu Ala Gin Lys Asn Lys Leu Arg His Leu Ser Val Val Thr Leu Ala 

-20 -15 -10 

sAla Lys Val Lys Cys He Pro Tyr Ala Val Leu Leu Glu Ala Leu Ala 
l!r^ 1 5 10 

•^|jeu Arg Asn Val Arg Gin Leu Glu Asp Leu Val He Glu Ala Val Tyr 
15 20 25 

Ala Asp Val Leu Arg Gly Ser Leu Asp Gin Arg Asn Gin Arg Leu Glu 

30 35 40 

Val Asp Tyr Ser He Gly Arg Asp He Gin Arg Gin Asp Leu Ser Ala 

45 50 55 

He Ala Arg Thr Leu Gin Glu Trp Cys Val Gly Cys Glu Val Val Leu 
60 65 70 75 

Ser Gly He Glu Glu Gin Val Ser Arg Ala Asn Gin His Lys Glu Gin 

80 85 90 

Gin Leu Gly Leu Lys Gin Gin He Glu Ser Glu Val Ala Asn Leu Lys 

95 100 105 

Lys Thr He Lys Val Thr Thr Ala Ala Ala Ala Ala Ala Thr Ser Gin 

110 115 120 

Asp Pro Glu Gin His Leu Thr Glu Leu Arg Glu Pro Ala Pro Gly Thr 

125 130 135 

Asn Gin Arg Gin Pro Ser Lys Lys Ala Ser Lys Gly Lys Gly Leu Arg 
140 145 150 155 

Gly Ser Ala Lys He Trp Ser Lys Ser Asn 
160 165 
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<210> 159 
<211> 24 
<212> PRT 

<213> Homo sapiens 
<400> 159 

Met Pro Thr Asn Cys Ala Ala Ala 
1 5 
His lie Asn lie Ser Phe His Arg 
20 



Gly Cys Ala Thr Thr Tyr Asn Lys 
10 15 



<210> 160 

<211> 228 

<212> PRT 

<213> Homo sapiens 



,,^,<400> 160 

'^-j^et Pro Thr Asn 

;4 

^^Ais lie Asn lie 
U 20 

;ijljys Glu Trp Val 
M 35 

ih|[is Thr Phe Leu 

yi 50 

''•thr Gly Gin Thr 
^^5 

I Asp Phe Cys Thr 

''%eu Lys Lys Asn 
H 100 
iJSer Asn lie Ser 

Arg Asn Pro Met 
130 

lie Ala Ser Leu 
145 

Arg Ala Thr Arg 

Glu Ala Asn Ser 
180 

Thr Ala Leu Ser 
195 

Asp Gin Gin Asp 
210 

Ser Thr Phe lie 
225 



Cys Ala Ala Ala 
5 

Ser Phe His Arg 

Arg Leu Val Arg 
40 

Cys Ser Lys His 
55 

Arg Arg Leu Lys 
70 

His lie Lys Ser 
85 

Asn Ser Cys Ser 

Ser Gin Gin Val 
120 

Glu Ala Lys Lys 
135 

Arg Arg Lys Met 
150 

Arg Trp lie Lys 
165 

Val Leu Pro Lys 

Ser Leu Pro Leu 
200 

Lys Thr Leu Leu 
215 



Gly Cys Ala Thr 
10 

Phe Pro Leu Asp 
25 

Arg Lys Asn Phe 

Phe Glu Ala Ser 
60 

Met Asp Ala Val 
75 

Met Lys Leu Lys 
90 

Pro Ala Gly Pro 
105 

Leu Leu Glu His 

Arg lie lie Lys 
140 

Lys Thr Cys Leu 
155 

Ala Met Cys Leu 
170 

Gly Thr Ser Glu 
185 

Glu Asp Phe Lys 

Ser Leu Asn Leu 
220 



Thr Tyr Asn Lys 
15 

Pro Lys Arg Arg 
30 

Val Pro Gly Lys 
45 

Cys Phe Asp Leu 

Pro Thr lie Phe 
80 

Ser Arg Asn Leu 
95 

Ser Ser Leu Lys 
110 

Ser Tyr Ala Phe 
125 

Leu Glu Lys Glu 

Gin Lys Glu Arg 
160 

Val Lys Asn Leu 
175 

His Met Leu Pro 
190 

lie Leu Glu Gin 
205 

Lys Gin Thr Lys 



<210> 161 
<211> 86 
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<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 

<222> -20. . -1 

<400> 161 

Met Asn Leu His Phe Pro Gin Trp Phe Val His Ser Ser Ala Leu Gly 
-20 -15 -10 ,5 

Leu Val Leu Ala Pro Pro Phe Ser Ser Pro Gly Thr Asp Pro Thr Phe 

15 10 
Pro Cys He Tyr Cys Arg Leu Leu Asn Met He Met Thr Arg Leu Ala 

15 20 25 

Phe Ser Phe He Thr Cys Leu Cys Pro Asn Leu Lys Glu Val Cys Leu 

30 35 40 

He Leu Pro Glu Lys Asn Cys Asn Ser Arg His Ala Gly Phe Val Gly 
45 50 55 60 

Pro Ala Lys Leu Arg Gin 
65 



ri<210> 162 

1|<211> 44 

^^^c212> PRT 

;js<213> Homo sapiens 

^^'V:400> 162 

^ket Ser Pro Arg Leu Glu Cys Ser Gly Ala He Leu Ala His Cys Asn 

5 10 15 

; Pro Arg Leu Pro Gly Ser Ser Tyr Ser Pro Ala Ser Ala Thr Trp Val 

20 25 30 

Ij^rg Gly Ser Leu Glu Pro Gly Arg Leu Arg Leu Gin 
35 40 



=^j*:210> 163 

<211> 314 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 

<222> -58 . . -1 

<400> 163 

Met Gin Asn Val He Asn Thr Val Lys Gly Lys Ala Leu Glu Val Ala 

-55 -50 -45 

Glu Tyr Leu Thr Pro Val Leu Lys Glu Ser Lys Phe Arg Glu Thr Gly 

-40 -35 -30 

Val He Thr Pro Glu Glu Phe Val Ala Ala Gly Asp His Leu Val His 

-25 -20 -15 

His Cys Pro Thr Trp Gin Trp Ala Thr Gly Glu Glu Leu Lys Val Lys 
-10 -5 1 5 

Ala Tyr Leu Pro Thr Gly Lys Gin Phe Leu Val Thr Lys Asn Val Pro 
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1 n 

X u 








1 IT 

lb 








20 


Cys 


Tvr* 


i-i_y o 


i\xg 


Cys Lys 


Gin 


Fieu 


VaXU 


Tyr 


Ser 


Asp 


Glu Leu Glu Ala 














30 










35 


lie 


lie 


Glu 


ox LL 


Asp Asp Gly 


Asp 


Gly 


Gly Trp Val 


Asp Thr Tyr His 




40 








45 










50 


Asn 


ThlT 


Gly 


lie 


Thr Gly He 


Thr 


Glu 


Ala 


Val 


Lys 


Glu He Thr Leu 










60 










65 




70 


Glu 


Asn 


Lys 




Asn He 


Arg 


Leu 


Gin 


Asp 


Cys 


Ser 


Ala Leu Cys Glu 










75 








80 






85 


Glu 


Glu 


Glu 




Glu Asp Glu 


Gly Glu 


Ala 


Ala 


Asp 


Met Glu Glu Tyr 








QPl 








95 








100 


Glu 


Glu 


OCX, 


vjx y 


Leu Leu 


Glu 


Thr 


Asp 


Glu 


Ala 


Thr 


Leu Asp Thr Arg 






lUb 








110 










115 


Lsys 


± ±e 


Va.X 


oXU 


Ala Cys 


Lys 


Ala 


Lys 


Thr 


Asp 


Ala 


Gly Gly Glu Asp 




x<s u 








125 










130 




J. 


Leu 


vjxn 


Thr Arg 


Thr 


Tyr 


Asp 


Leu 


Tyr 


He 


Thr Tyr Asp Lys 


135 








140 










145 




150 


■'■y-'- 


Tvr* 




1 111. 


Pro Arg 


Leu 


Trp 


Leu 


Phe 


Gly Tyr 


Asp Glu Gin Arg 










155 








160 






165 


m n 




Leu 


TxiuT 


Val Glu 


His 


Met 


Tyr 


Glu 


Asp 


He 


Ser Gin Asp His 








X / (J 








175 








180 




Lys 


Lys 


Thr 


Val Thr 


He 


Glu 


Asn 


His 


Pro 


His 


Leu Pro Pro Pro 






185 








190 










195 


Met 


Cys 


Ser 


Val His 


Pro 


Cys Arg 


His 


Ala 


Glu 


vax iviet- Xjys Lys 




200 








205 










210 


^lltle 


lie 


Glu 


Thr 


Val Ala 


Glu 


Gly Gly 


Gly Glu Leu 


Gly Val His Met 










220 










225 




230 




Leu 


Leu 


He 


Phe Leu 


Lys 


Phe 


Val 


Gin 


Ala 


Val 


He Pro Thr He 


;!&lU 








235 








240 






245 


Tyr 


Asp 


Tyr 


Thr Arg His 


Phe 


Thr 


Met 









^=^210> 164 

^§:211> 89 

i;|:212> PRT 

V|:213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -80 . . -1 



<400> 164 

Met Arg Thr Arg 

-80 

Pro Arg Arg Pro 

Thr Ser Cys Ser 
-45 

Leu Cys Gly Asp 
-30 

Gin Leu Gly Arg 
-15 

Ser Thr Gin Pro 
1 



Thr Thr Gly Asn 
-75 

Arg Leu Gly Arg 
-60 

Asp Leu Leu Pro 

Gin Leu Gin Gly 
-25 

Gly Leu Leu Ser 
-10 

Val Pro Leu Cys 
5 



Pro Arg Gly Leu 
-70 

Cys Ser Asp Met 
-55 

Trp Glu Gly Val 
-40 

Thr Glu Gly Trp 

Ala Cys Ala Pro 
-5 

Ser 



His Asp Thr Phe 
-65 

Asp Thr Ala Arg 
-50 

Thr Glu Pro Ala 
-35 

Leu Glu Ala Thr 
-20 

Trp Gly Asp Gly 
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<210> 165 

<211> 98 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -15. .-1 



<400> 165 
























Met Glu 


Ala 


Met 


Trp 


Leu Leu 


Cys 


Val 


Ala 


Leu Ala 


Val 


Leu 


Ala 


Trp 


-15 








-10 








-5 








1 


Gly Phe 


Leu 


Trp 


Val 


Trp Asp 


Ser 


Ser 


Glu 


Arg Met 


Lys 


Ser 


Arg 


Glu 






5 








10 








15 






Gin Gly 


Gly Arg 


Leu Gly Ala 


Glu 


Ser 


Arg 


Thr Leu 


Leu 


Val 


He 


Ala 




20 








25 








30 








His Pro 


Asp 


Asp 


Glu 


Ala Met 


Phe 


Phe 


Ala 


Pro Thr 


Val 


Leu Gly Leu 


35 








40 








45 










Ala Arg 


Leu 


Arg 


His 


Trp Val 


Tyr 


Leu 


Leu 


Cys Phe 


Ser 


Ala 


Val 


Phe 


50 








55 








60 








65 


lArg Arg 


Glu 


Leu 


Ser 


Glu Tyr 


Thr Glu Gly Leu Thr 


Ser 


Glu 


Pro 


Leu 






70 








75 








80 





flirhr Ala 



;<210> 166 

''^211> 92 

^^^c212> PRT 

\<213> Homo sapiens 

\k220> 

''''h221> SIGNAL 
^^|:222> -36. . -1 

Vf400> 166 

'ket Leu Val Thr Gin Gly Leu Val Tyr Gin Gly Tyr Leu Ala Ala Asn 

-35 -30 -25 

Ser Arg Phe Gly Ser Leu Pro Lys Val Ala Leu Ala Gly Leu Leu Gly 
-20 -15 -10 -5 

Phe Gly Leu Gly Lys Val Ser Tyr He Gly Val Cys Gin Ser Lys Phe 

15 10 
His Phe Phe Glu Asp Gin Leu Arg Gly Ala Gly Phe Gly Pro Gin His 

15 20 25 

Asn Arg His Cys Leu Leu Thr Cys Glu Glu Cys Lys He Lys His Gly 

30 35 40 

Leu Ser Glu Lys Gly Asp Ser Gin Pro Ser Ala Ser 
45 50 55 



<210> 167 
<211> 351 
<212> PRT 

<213> Homo sapiens 
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<220> 

<221> SIGNAL 
<222> -16. . -1 

<400> 167 

Met Val Pro Phe He Tyr Leu Gin Ala His Phe Thr Leu Cys Ser Gly 

-15 -10 -5 

Trp Ser Ser Thr Tyr Arg Asp Leu Arg Lys Gly Val Tyr Val Pro Tyr 
15 10 15 

Thr Gin Gly Lys Trp Glu Gly Glu Leu Gly Thr Asp Leu Val Ser He 

20 25 30 

Pro His Gly Pro Asn Val Thr Val Arg Ala Asn He Ala Ala He Thr 

35 40 45 

Glu Ser Asp Lys Phe Phe He Asn Gly Ser Asn Trp Glu Gly He Leu 

50 55 60 

Gly Leu Ala Tyr Ala Glu He Ala Arg Pro Asp Asp Ser Pro Glu Pro 
^5 70 75 80 

Phe Phe Asp Ser Leu Val Lys Gin Thr His Val Pro Asn Leu Phe Ser 

85 90 95 

Leu Gin Leu Cys Gly Ala Gly Phe Pro Leu Asn Gin Ser Glu Val Leu 

105 110 
;.kla Ser Val Gly Gly Ser Met He He Gly Gly He Asp His Ser Leu 
J!I 115 120 125 

\fryr Thr Gly Ser Leu Trp Tyr Thr Pro He Arg Arg Glu Trp Tyr Tyr 
.ib 130 135 140 

j^plu Val He He Val Arg Val Glu He Asn Gly Gin Asp Leu Lys Met 
j;il45 150 155 160 

;Asp Cys Lys Glu Tyr Asn Tyr Asp Lys Ser He Val Asp Ser Gly Thr 
% 165 170 175 

=^^hr Asn Leu Arg Leu Pro Lys Lys Val Phe Glu Ala Ala Val Lys Ser 

180 185 190 

;=;'He Lys Ala Ala Ser Ser Thr Glu Lys Phe Pro Asp Gly Phe Trp Leu 
y 195 200 205 

.i^31y Glu Gin Leu Val Cys Trp Gin Ala Gly Thr Thr Pro Trp Asn He 
4 210 215 220 

jf he Pro Val He Ser Leu Tyr Leu Met Gly Glu Val Thr Asn Gin Ser 
^^25 230 235 240 

''*he Arg He Thr He Leu Pro Gin Gin Tyr Leu Arg Pro Val Glu Asp 

245 250 255 

Val Ala Thr Ser Gin Asp Asp Cys Tyr Lys Phe Ala He Ser Gin Ser 

260 265 270 

Ser Thr Gly Thr Val Met Gly Ala Val He Met Glu Gly Phe Tyr Val 

275 280 285 

Val Phe Asp Arg Ala Arg Lys Arg He Gly Phe Ala Val Ser Ala Cys 

290 295 300 

His Val His Asp Glu Phe Arg Thr Ala Ala Val Glu Gly Pro Phe Cys 
305 310 315 320 

His Leu Gly His Gly Arg Leu Trp Leu Gin His Ser Thr Asp Arg 
325 330 335 



<210> 168 
<211> 138 
<212> PRT 
<213> Homo sapiens 
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<220> 

<221> SIGNAL 
<222> -47. . -1 

<400> 168 



Met Glu 


Lvs 


Phe 


Val 


Asp Pro Gly Asn His 


Asn Ser Gly He Asp Leu 




-45 






-40 


-35 




Leu Arg 


Thr 


Tvr 

a. Jf A. 


Leu Trp Arg Cys Gin Phe 


Leu Leu Pro Phe 


Val Ser 


-30 








-25 


-20 




Leu Gly 


Leu 


Met 


Cys 


Phe Gly Ala Leu lie 


Gly Leu Cys Ala 


Cys lie 


-15 








-10 


-5 


1 


Cys Arg 


Ser 


Leu 


Tyr 


Pro Thr lie Ala Thr 


Gly He Leu His 


Leu Leu 






5 




10 


15 




Ala Gly 


Leu 


Cys 


Thr 


Leu Gly Ser Val Ser 


Cys Tyr Val Ala 


Gly He 




20 






25 


30 


Glu Leu 


Leu 


His 


Gin 


Lys Leu Glu Leu Pro 


Asp Asn Val Ser 


Gly Glu 


35 








40 


45 


Phe Gly 


Trp 


Ser 


Phe 


Cys Leu Ala Cys Val 


Ser Ala Pro Leu 


Gin Phe 


50 








55 


60 


65 


Met Ala 


Ser 


Ala 


Leu 


Phe lie Trp Ala Ala 


His Thr Asn Arg Arg Glu 








70 


75 




80 


-f^Tyr Thr 


Leu 


Met 


Lys 


Ala Tyr Arg Val Ala 










85 




90 







j;^210> 169 

;s<211> 101 

pj=:212> PRT 

'''<213> Homo sapiens 

f<220> 

'^ic221> SIGNAL 
"k222> -73 . . -1 

||]c400> 169 

iiflet Asn Leu Glu Arg Val Ser Asn Glu Glu Lys Leu Asn Leu Cys Arg 
-70 -65 -60 

Lys Tyr Tyr Leu Gly Gly Phe Ala Phe Leu Pro Phe Leu Trp Leu Val 

-55 -50 -45 

Asn He Phe Trp Phe Tyr Arg Glu Ala Phe Leu Val Pro Ala Tyr Thr 

-40 -35 -30 

Glu Gin Ser Gin He Lys Gly Tyr Val Trp Arg Ser Ala Val Gly Phe 
-25 -20 -15 -10 

Leu Phe Trp Val He Val Leu Thr Ser Trp He Thr He Phe Gin He 

-5 1 5 

Tyr Arg Pro Arg Trp Gly Ala Leu Gly Asp Tyr Leu Ser Phe Thr He 

10 15 20 

Pro Leu Gly Thr Pro 
25 



<210> 170 
<211> 252 
<212> PRT 
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<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -68 . . -1 



<400> 170 
Met Pro Glu Gly 
-65 

Ala Cys Arg Ala 
-50 

Ser Arg Asn Pro 
-35 

Ala Ser Ala Arg 
-20 

Gly Ala Gin Pro 

Met Ser Gly Ser 
15 

Ala His Leu Arg 
30 

,,.^Cys Phe Val Asp 
C^l45 

yibrp Gin Pro Gly 
f^s^Arg Glu Asn Val 

,gXle Cys Glu Ala 

hi 

:;"j!pyr Leu Arg Ala 
110 

Lys Ala Arg Ser 

^^;l2 5 

^^iblu Leu Thr Leu 
■%|L,eu Leu Glu Leu 

y;i 160 

fvpiu Ala Lys Asp 
175 



Pro Glu Leu His 

Leu Val Phe Gly 
-45 

Glu Val Pro Phe 
-30 

Gly Lys Glu Leu 
-15 

Gin Gin Glu Pro 
1 

Phe Gin Leu Val 
20 

Phe Tyr Thr Ala 
35 

lie Arg Arg Phe 
50 

Arg Gly Pro Cys 
65 

Leu Arg Asn Leu 

Leu Leu Asp Gin 
100 

Glu He Leu Tyr 
115 

Val Leu Glu Ala 
130 

Ser Gin Lys He 
145 

Cys His Ser Val 

Gly Ser Asn Leu 
180 



Leu Ala Ser Gin 
-60 

Gly Cys Val Glu 

Glu Ser Ser Ala 
-25 

Arg Leu He Leu 
-10 

Leu Ala Leu Val 
5 

Pro Arg Glu Glu 

Pro Pro Gly Pro 
40 

Gly Arg Trp Asp 
55 

Val Leu Gin Glu 
70 

Ala Asp Lys Ala 
85 

Arg Phe Phe Asn 

Arg Leu Lys He 
120 

Leu Gin Gin His 
135 

Arg Thr Lys Leu 
150 

Pro Lys Glu Val 
165 

Cys Phe Ser Lys 



Phe Val Asn Glu 
-55 

Lys Ser Ser Val 
-40 

Tyr Arg He Ser 

Ser Pro Leu Pro 
-5 

Phe Arg Phe Gly 
10 

Leu Pro Arg His 
25 

Arg Leu Ala Leu 

Leu Gly Gly Lys 
60 

Tyr Gin Gin Phe 
75 

Phe Asp Arg Pro 
90 

Gly He Gly Asn 
105 

Pro Pro Phe Glu 

Arg Pro Ser Pro 
140 

Gin Asn Ser Asp 
155 

Val Gin Leu Gly 
170 



<210> 171 
<211> 350 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -68 . . -1 



<400> 171 
Met Pro Glu Gly 
-65 

Ala Cys Arg Ala 
-50 

Ser Arg Asn Pro 



Pro Glu Leu His 

Leu Val Phe Gly 
-45 

Glu Val Pro Phe 



Leu Ala Ser Gin 
-60 

Gly Cys Val Glu 
Glu Ser Ser Ala 



Phe Val Asn Glu 
-55 

Lys Ser Ser Val 
-40 

Tyr Arg He Ser 



-140- 



-35 

Ala Ser Ala Arg 
-20 

Gly Ala Gin Pro 

Met Ser Gly Ser 
15 

Ala His Leu Arg 
30 

Cys Phe Val Asp 
45 

Trp Gin Pro Gly 

Arg Leu Lys He 
80 

Leu Gin Gin His 
95 

Arg Thr Lys Leu 
110 

Pro Lys Glu Val 
125 

Gly Glu Glu Asp 

i!ket Pro Gly Met 
II 160 
r^^he Gin Gly Asp 
J1 175 
,lpys Lys Lys Ser 
.i^ 190 
"3^sp Ala Leu Pro 
^'^05 

' Arg Asp Leu Pro 

^Leu Gin Gin Asp 
240 

■^^.fjys Gly Arg Gin 
0 255 
^vAla Asp He Pro 
2 70 



-30 

Gly Lys Glu Leu 
-15 

Gin Gin Glu Pro 
1 

Phe Gin Leu Val 
20 

Phe Tyr Thr Ala 
35 

He Arg Arg Phe 
50 

Arg Gly Pro Cys 
65 

Pro Pro Phe Glu 

Arg Pro Ser Pro 
100 

Gin Asn Pro Asp 
115 

Asp Gin Leu Gly 
130 

Phe Ala Ala Phe 
145 

Ser Ser Leu Gin 

Pro Gly Pro Leu 
180 

Lys Ala Thr Gin 
195 

Pro Ser Lys Ala 
210 

Lys Arg Thr Ala 
225 

Pro Glu Ala Pro 

Ala Ala Ser Gly 
260 

Ser Leu Glu Pro 
275 



-25 

Arg Leu He Leu 
-10 

Leu Ala Leu Val 
5 

Pro Arg Glu Glu 

Pro Pro Gly Pro 
40 

Gly Arg Trp Asp 
55 

Val Leu Gin Glu 
70 

Lys Ala Arg Ser 
85 

Glu Leu Thr Leu 

Leu Leu Glu Leu 
120 

Gly Arg Gly Tyr 
135 

Arg Ala Trp Leu 
150 

Asp Arg His Gly 
165 

Ala Pro Lys Gly 

Leu Ser Pro Glu 
200 

Pro Ser Lys Thr 
215 

Thr Gin Arg Pro 
230 

Thr Val Pro Lys 
245 

His Cys Arg Pro 

Glu Gly Thr Ser 
280 



Ser Pro Leu Pro 
-5 

Phe Arg Phe Gly 
10 

Leu Pro Arg His 
25 

Arg Leu Ala Leu 

Leu Gly Gly Lys 
60 

Tyr Gin Gin Phe 
75 

Val Leu Glu Ala 
90 

Ser Gin Lys He 
105 

Cys His Ser Val 

Gly Ser Glu Ser 
140 

Arg Cys Tyr Gly 
155 

Arg Thr He Trp 
170 

Arg Lys Ser Arg 
185 

Asp Arg Val Glu 

Arg Arg Ala Lys 
220 

Glu Gly Thr Ser 
235 

Lys Gly Arg Arg 
250 

Arg Lys Val Lys 
265 

Ala Ser 



<210> 172 
<211> 390 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -68 . . -1 

<400> 172 

Met Pro Glu Gly Pro Glu Leu His Leu Ala Ser Gin Phe Val Asn Glu 

-65 -60 -55 

Ala Cys Arg Ala Leu Val Phe Gly Gly Cys Val Glu Lys Ser Ser Val 

-50 -45 -40 

Ser Arg Asn Pro Glu Val Pro Phe Glu Ser Ser Ala Tyr Arg He Ser 



-141- 



-35 -30 -25 

Ala Ser Ala Arg Gly Lys Glu Leu Arg Leu He Leu Ser Pro Leu Pro 
-20 -15 -10 .5 

Gly Ala Gin Pro Gin Gin Glu Pro Leu Ala Leu Val Phe Arg Phe Gly 

15 10 
Met Ser Gly Ser Phe Gin Leu Val Pro Arg Glu Glu Leu Pro Arg His 

15 20 25 

Ala His Leu Arg Phe Tyr Thr Ala Pro Pro Gly Pro Arg Leu Ala Leu 

30 35 40 

Cys Phe Val Asp He Arg Arg Phe Gly Arg Trp Asp Leu Gly Gly Lys 
45 50 55 60 

Trp Gin Pro Gly Arg Gly Pro Cys Val Leu Gin Glu Tyr Gin Gin Phe 

65 70 75 

Arg Glu Asn Val Leu Arg Asn Leu Ala Asp Lys Ala Phe Asp Arg Pro 

80 85 90 

He Cys Glu Ala Leu Leu Asp Gin Arg Phe Phe Asn Gly He Gly Asn 

95 100 105 

Tyr Leu Arg Ala Glu He Leu Tyr Arg Leu Lys He Pro Pro Phe Glu 

110 115 120 

Lys Ala Arg Ser Val Leu Glu Ala Leu Gin Gin His Arg Pro Ser Pro 
125 130 135 140 

Glu Leu Thr Leu Ser Gin Lys He Arg Thr Lys Leu Gin Asn Pro Asp 
U 145 150 155 

::|;Leu Leu Glu Leu Cys His Ser Val Pro Lys Glu Val Val Gin Leu Gly 
PJ 160 165 170 

|.jGly Arg Gly Tyr Gly Ser Glu Ser Gly Glu Glu Asp Phe Ala Ala Phe 
Ul 175 180 185 

j;Arg Ala Trp Leu Arg Cys Tyr Gly Met Pro Gly Met Ser Ser Leu Gin 
[ij 195 2 00 

fiSfsp Arg His Gly Arg Thr He Trp Phe Gin Gly Asp Pro Gly Pro Leu 
:205 210 215 220 

J' Ala Pro Lys Gly Arg Lys Ser Arg Lys Lys Lys Ser Lys Ala Thr Gin 

225 230 235 

^iteu Ser Pro Glu Asp Arg Val Glu Asp Ala Leu Pro Pro Ser Lys Ala 
fr^^ 240 245 250 

Hfro Ser Arg Thr Arg Arg Ala Lys Arg Asp Leu Pro Lys Arg Thr Ala 
y1 255 260 265 

^^^fhr Gin Arg Pro Glu Gly Thr Ser Leu Gin Gin Asp Pro Glu Ala Pro 
270 275 280 

Thr Val Pro Lys Lys Gly Arg Arg Lys Gly Arg Gin Ala Ala Ser Gly 
285 290 295 300 

His Cys Arg Pro Arg Lys Val Lys Ala Asp He Pro Ser Leu Glu Pro 

305 310 315 

Glu Gly Thr Ser Ala Ser 
320 



<210> 173 
<211> 190 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -82 . . -1 



-142- 



<400> 173 

Met Tyr Val Trp Pro Cys Ala Val Val Leu Ala Gin Tyr Leu Trp Phe 

-80 -75 -70 

His Arg Arg Ser Leu Pro Gly Lys Ala lie Leu Glu He Gly Ala Gly 

-65 -60 -55 

Val Ser Leu Pro Gly He Leu Thr Ala Lys Cys Gly Ala Glu Val He 
-50 -45 -40 -35 

Leu Ser Asp Ser Ser Glu Leu Pro His Cys Leu Glu Val Cys Arg Gin 

-30 -25 -20 

Ser Cys Gin Met Asn Asn Leu Pro His Leu Gin Val Val Gly Leu Thr 

-15 -10 -5 

Trp Gly His He Ser Trp Asp Leu Leu Ala Leu Pro Pro Gin Asp He 

15 10 
He Leu Ala Ser Asp Val Phe Phe Glu Pro Glu Asp Phe Glu Asp He 
15 20 25 30 

Leu Ala Thr He Tyr Phe Leu Met His Lys Asn Pro Lys Val Gin Leu 

35 40 45 

Trp Ser Thr Tyr Gin Val Arg Ser Ala Asp Trp Ser Leu Glu Ala Leu 

50 55 60 

Leu Tyr Lys Trp Asp Met Lys Cys Val His He Pro Leu Glu Ser Phe 
S5 70 75 

^ Asp Ala Asp Lys Glu Asp He Ala Glu Ser Thr Leu Pro Gly Arg His 

80 85 90 

i;irhr Val Glu Met Leu Val He Ser Phe Ala Lys Asp Ser Leu 
lP*5 100 105 



;?c210> 174 
';J:211> 285 
'^k212> PRT 
' <213> Homo sapiens 

l|c22 0> 

^^k221> SIGNTU:. 
S|:222> -232. . -1 

V|:400> 174 

'ket Gly Cys Val Phe Gin Ser Thr Glu Asp Lys Arg He Phe Lys He 

-230 -225 -220 

Asp Trp Thr Leu Ser Pro Gly Glu His Ala Lys Asp Glu Tyr Val Leu 

-215 -210 -205 

Tyr Tyr Tyr Ser Asn Leu Ser Val Pro He Gly Arg Phe Gin Asn Arg 
-200 -195 -190 -185 

Val His Leu Met Gly Asp Asn Leu Cys Asn Asp Gly Ser Leu Leu Leu 

-180 -175 -170 

Gin Asp Val Gin Glu Ala Asp Gin Gly Thr Tyr He Cys Glu He Arg 

-165 -160 -155 

Leu Lys Gly Glu Ser Gin Val Phe Lys Lys Ala Val Val Leu His Val 

-150 -145 _3_40 

Leu Pro Glu Glu Pro Lys Glu Leu Met Val His Val Gly Gly Leu He 

-135 -130 -125 

Gin Met Gly Cys Val Phe Gin Ser Thr Glu Val Lys His Val Thr Lys 
-120 -115 -110 _105 

Val Glu Trp He Phe Ser Gly Arg Arg Ala Lys Glu Glu He Val Phe 

-100 -95 -90 

Arg Tyr Tyr His Lys Leu Arg Met Ser Ala Glu Tyr Ser Gin Ser Trp 



-143- 



Gly His Phe 
-70 

Asp Gly Ser 
-55 

Tyr Thr Cys 
-40 

Val Leu His 

Ala Leu Arg 

Gly He Val 
10 

Val Lys Lys 
25 

Val Lys Asn 



-85 

Gin Asn 

He Met 

Ser He 

Val Ser 
-20 
Pro Leu 
-5 

Cys Ala 

Thr Cys 

Thr Lys 
45 



Arg Val 

Leu Gin 
-50 
His Leu 
-35 

Pro Glu 

Val Leu 

Thr He 
15 

Gly Asn 
30 

Lys Thr 



-80 
Asn Leu 
-65 

Gly Val 

Gly Asn 

Glu Pro 

Gly Gly 
1 

Leu Leu 
Lys Ser 
Asn Pro 



Val Gly Asp 

Arg Glu Ser 
-45 

Leu Val Phe 
-30 

Arg Thr Leu 
-15 

Asn Gin Leu 

Leu Pro Val 
20 

Ser Val Asn 
35 

Lys Lys Lys 
50 



-75 
He Phe 
-60 

Asp Gly 

Lys Lys 

Val Thr 

Val He 
5 

Leu He 
Ser Thr 
Lys 



Arg Asn 

Gly Asn 

Thr He 
-25 
Pro Ala 
-10 

He Val 

Leu He 

Val Leu 
40 



<210> 175 

<211> 153 
;';i<212> PRT 
j||<213> Homo sapiens 

,^5:400> 175 

j.f4et Gly Cys Val Phe Gin Ser Thr Val Asp Lys Cys He Phe Lys He 
■1^ ^ 10 15 

ffsp Trp Thr Leu Ser Pro Gly Glu His Ala Lys Asp Glu Tyr Val Leu 

20 25 30 

^^-^Tyr Tyr Tyr Ser Asn Leu Ser Val Pro He Gly Arg Phe Gin Asn Arg 
: 35 40 45 

F^al His Leu Met Gly Asp He Leu Cys Asn Asp Gly Ser Leu Leu Leu 
f^J 50 55 60 

j.Sln Asp Val Gin Glu Ala Asp Gin Gly Thr Tyr He Cys Glu He Arg 
-f^ 70 75 80 

^jf eu Lys Gly Glu Ser Gin Val Phe Lys Lys Ala Val Val Leu His Val 
ffk 85 90 95 

^=^Leu Pro Glu Glu Pro Lys Glu Leu Met Val His Val Gly Gly Leu He 
100 105 110 

Gin Met Gly Cys Val Phe Gin Ser Thr Glu Val Lys His Val Thr Lys 

120 125 
Val Glu Trp He Phe Ser Gly Arg Arg Ala Lys Val Thr Arg Arg Lys 

130 135 240 

His His Cys Val Arg Glu Gly Ser Gly 
145 150 



<210> 176 

<211> 49 

<212> PRT 

<213> Homo sapiens 



<400> 176 

Met Leu Xaa Gly Asp His Arg Ala Leu Leu Leu Lys He Trp Leu Leu 
15 10 15 



-144- 



Gln Arg Pro Glu Ser Gin Glu Gly Leu Leu Pro Gly Arg Leu Val Val 

20 25 30 

Met Glu Arg Arg Val Lys Met Thr Ser Cys Pro Ser Cys Pro Arg Phe 
35 40 45 

Cys 



<210> 177 
<211> 99 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -24 . . -1 



<400> 177 

Met Lys Ser Ala Lys 
-20 

Ser Leu Asn Thr Leu 
-5 

^Jkle Cys Gly Asp Leu 
10 

fl^er Cys Tyr Glu Val 

^.i25 

[i^Arg Cys Glu Thr Phe 
I'jphe Lys Leu Lys lie 

J'!;, 60 

'''bxo Pro Arg 

: 75 



Leu Gly Phe Leu Leu Arg 
-15 

Leu Leu Gly Gly Val Asn 
1 

Lys Asp Pro Cys Lys Leu 
15 

His Phe Arg Tyr Phe Tyr 
30 35 
Val Phe Ser Gly Cys Asn 
50 

Glu Arg Glu Val Ala Cys 
65 



Phe Phe He Phe Cys 
-10 

Lys He Ala Glu Lys 
5 

Asp Met Asn Phe Gly 
20 

Asn Arg Thr Ser Lys 
40 

Gly Asn Leu Asn Asn 
55 

Val Ala Lys Tyr Lys 
70 



^^.|210> 178 

,|i211> 95 

.■^^212> PRT 

''<:213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -37 . . -1 



<400> 178 
Met Ala Ser Pro 
-35 

Val Trp Pro Arg 
-20 

Arg Leu Gly Asp 
-5 

Ser Cys Val Arg 
15 

Pro Cys Leu Gin 
30 

Arg Ser Phe Pro 
45 



Ala Val Asn Arg 
-30 

Arg Leu Glu Ser 
-15 

Thr Lys Lys Lys 
1 

Ser Ser Cys Gly 

Gin Pro Asp Pro 
35 

Leu Phe Pro Ser 
50 



Trp Lys Arg Pro 

Trp Leu Leu Leu 
-10 

Arg Gin Pro Glu 
5 

Gly Pro Ser Gly 
20 

Arg Ala Leu Ser 

Leu Ala Gly Lys 
55 



Arg Leu Lys Pro 
-25 

Asp Ala Leu Leu 

Ala Ala Thr Lys 
10 

Asp Gly Pro Pro 
25 

Gin Ala Phe Ser 
40 

Ser Met He 



-145- 



<210> 179 

<211> 121 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 

<222> -23. .-1 

<400> 179 

Met Met Leu Pro Gin Trp Leu Leu Leu Leu Phe Leu Leu Phe Phe Phe 

-20 -15 -10 

Leu Phe Leu Leu Thr Arg Gly Ser Leu Ser Pro Thr Lys Tyr Asn Leu 

-5 15 
Leu Glu Leu Lys Glu Ser Cys He Arg Asn Gin Asp Cys Glu Thr Gly 
10 15 20 25 

Cys Cys Gin Arg Ala Pro Asp Asn Cys Glu Ser His Cys Ala Glu Lys 

30 35 40 

Gly Ser Glu Gly Ser Leu Cys Gin Thr Gin Val Phe Phe Gly Gin Tyr 
f 45 50 55 

J.Arg Ala Cys Pro Cys Leu Arg Asn Leu Thr Cys He Tyr Ser Lys Asn 
'^J 60 65 70 

==iGlu Lys Trp Leu Ser He Ala Tyr Gly Arg Cys Gin Lys He Gly Arg 
J1 75 80 85 

|;Gln Lys Leu Ala Lys Lys Met Phe Phe 
■ ^0 95 



:'"<210> 180 

\k211> 59 

^^ik212> PRT 

^^Jc213> Homo sapiens 

^!!^400> 180 

"'Met He Leu Cys Phe Leu Leu Pro His His Arg Leu Gin Glu Ala Arg 
15 10 15 

Gin He Gin Val Leu Lys Met Leu Pro Arg Glu Lys Leu Arg Arg Arg 

20 25 30 

Glu Glu Arg Lys Gin He Asn Gly Lys Lys Glu Arg Thr Lys Tyr Glu 

35 40 45 

Thr Pro Arg Lys Arg Glu Gly Lys Lys Lys Lys 
50 55 



<210> 181 
<211> 86 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -14. .-1 
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<400> 181 

Met Val Ala Leu 

Asp Pro Arg Arg 
5 

Ala Asp Thr Gly 
20 

Arg Arg Ser Gin 
35 

Glu Asn Gin Pro 

Tyr Arg lie Cys 
70 



Asn Leu lie Leu 
-10 

lie His Ser Gin 
10 

Ser Ala Met Gin 
25 

Pro Phe Ser Val 
40 

Gly Lys Leu Ser 
55 

Asp Leu 



Val Pro Cys Cys 
-5 

Asp Asp Val Pro 

Arg Arg Glu Ala 
30 

Gly Leu Pro Ser 
45 

Trp Arg Ser Leu 
60 



Ala Ala Trp Cys 
1 

Arg Ser Ser Ala 
15 

Trp Ala Gly Trp 

Ala Glu Arg Leu 
50 

Val Gly Glu Gly 
65 



<210> 182 
<211> 165 
<212> PRT 
<213> Homo sapiens 

;jSc221> SIGNAL 
\lc222> -58 . . -1 

J"l:400> 182 

llf^et Thr Arg Leu Cys Leu Pro Arg Pro Glu Ala Arg Glu Asp Pro He 

;J -55 -50 -45 

if ro Val Pro Pro Arg Gly Leu Gly Ala Gly Glu Gly Ser Gly Ser Pro 

y -40 -35 -30 

■ yal Arg Pro Pro Val Ser Thr Trp Gly Pro Ser Trp Ala Gin Leu Leu 

Z -25 -20 -15 

=Asp Ser Val Leu Trp Leu Gly Ala Leu Gly Leu Thr He Gin Ala Val 

-^10 -5 15 

^^Phe Ser Thr Thr Gly Pro Ala Leu Leu Leu Leu Leu Val Ser Phe Leu 

:M 10 15 20 

jThr Phe Asp Leu Leu His Arg Pro Ala Gly His Thr Leu Pro Gin Arg 

25 30 35 

Lys Leu Leu Thr Arg Gly Gin Ser Gin Gly Ala Gly Glu Gly Pro Gly 

40 45 50 

Gin Gin Glu Ala Leu Leu Leu Gin Met Gly Thr Val Ser Gly Gin Leu 
55 60 65 70 

Ser Leu Gin Asp Ala Leu Leu Leu Leu Leu Met Gly Leu Gly Pro Leu 

75 80 85 

Leu Arg Ala Cys Gly Met Pro Leu Thr Leu Leu Gly Leu Ala Phe Cys 

90 95 100 

Leu His Pro Trp Ala 
105 



<210> 183 
<211> 80 
<212> PRT 

<213> Homo sapiens 
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<220> 

<221> SIGNAL 
<222> -35. . -1 

<400> 183 

Met Pro Phe Gin Phe Gly Thr Gin Pro Arg Arg Phe Pro Val Glu Gly 
-35 -30 -25 -20 

Gly Asp Ser Ser lie Glu Leu Glu Pro Gly Leu Ser Ser Ser Ala Ala 

-15 -10 -5 

Cys Asn Gly Lys Glu Met Ser Pro Thr Arg Gin Leu Arg Arg Cys Pro 

15 10 
Gly Ser His Cys Leu Thr He Thr Asp Val Pro Val Thr Val Tyr Ala 

15 20 25 

Thr Thr Arg Lys Pro Pro Ala Gin Ser Ser Lys Glu Met His Pro Lys 
30 35 40 45 



<210> 184 
<211> 73 
<212> PRT 
,,,.^213> Homo sapiens 

;j^<22 0> 

^k221> SIGNAL 
^=^^222> -21. . -1 

^ H 

|;5C400> 184 

;Met Ala Pro Gin Thr Leu Leu Pro Val Leu Val Leu Cys Val Leu Leu 
j1 -20 -15 -10 

■"Leu Gin Ala Gin Gly Gly Tyr Arg Asp Lys Met Arg Met Gin Arg He 

,fys Val Cys Glu Lys Arg Pro Ser He Asp Leu Cys He His His Cys 

15 20 25 

'■'%er Cys Phe Gin Lys Cys Glu Thr Asn Lys He Cys Cys Ser Ala Phe 

^^'1 30 35 40 

|||:ys Gly Asn He Cys Met Ser He Leu 

Yl 45 50 



<210> 185 
<211> 98 
<212> PRT 

<213> Homo sapiens 
<400> 185 

Met Leu Gly Ala Glu Thr Glu Glu Lys Leu Phe Asp Ala Pro Leu Ser 

15 10 15 

He Ser Lys Arg Glu Gin Leu Glu Gin Gin Val Pro Glu Asn Tyr Phe 

20 25 30 

Tyr Val Pro Asp Leu Gly Gin Val Pro Glu He Asp Val Pro Ser Tyr 

35 40 45 

Leu Pro Asp Leu Pro Gly He Ala Asn Asp Leu Met Tyr He Ala Asp 

50 55 60 

Leu Gly Pro Gly He Ala Pro Ser Ala Pro Gly Thr He Pro Glu Leu 
65 70 75 80 



-148- 



Pro Thr Phe His Thr Glu Val Ala Glu Pro Leu Lys Thr Tyr Lys Met 
85 90 95 

Gly Tyr 



<210> 186 
<211> 112 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -21. . -1 

<400> 186 

Met Glu Ser Arg Val Leu Leu Arg Thr Phe Cys Leu He Phe Gly Leu 

-20 -15 -10 

Gly Ala Val Trp Gly Leu Gly Val Asp Pro Ser Leu Gin He Asp Val 
-5 15 10 

Leu Thr Glu Leu Glu Leu Gly Glu Ser Thr Thr Gly Val Arg Gin Val 
15 2 0 25 

^;=;.|>ro Gly Leu His Asn Gly Thr Lys Ala Phe Leu Phe Gin Asp Thr Pro 

35 40 
I'krg Ser He Lys Ala Ser Thr Ala Thr Ala Glu Gin Phe Phe Gin Lys 

45 50 55 

iJ'Leu Arg Asn Lys His Glu Phe Thr He Leu Val Thr Leu Lys Gin Thr 
JS^O 65 70 75 

ih|[is Leu Asn Ser Gly Val He Leu Ser He His His Leu Asp His Arg 
m 80 85 90 



-'^210> 


187 


iH^2ii> 


70 


'^^|:212> 


PRT 


#213> 


Homo sapiens 


<220> 




<221> 


SIGNAL 


<222> 


-44. .-1 


<400> 


187 



Met Cys Cys Tyr Cys Arg He Phe Cys Leu Arg Cys Thr Tyr Phe Pro 
-40 -35 -30 

Val His Cys Gly Met Cys Asn Leu Arg Tyr Phe Glu Phe Ser Thr Phe 
-25 -20 -15 

Leu Leu Ser Leu Ser Leu He Thr Tyr Cys Phe Trp Asp Pro Pro His 
-10 -5 1 

Arg Gly Ser His Ser Leu Ser Leu Glu His Thr Pro Leu Asp Phe Leu 

5 10 15 20 

Glu Trp Gly Leu Leu Arg 
25 



<210> 188 
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<211> 92 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -13 . . -1 



<400> 188 






















Met 


Leu 


Phe Ser 
-10 


Leu 


Ser 


Leu 


Leu 


Ser 
-5 


Asn 


Leu 


Asn 


Gin 


He Gly Ser 
1 


Ser 


His 


Leu Asp 


Arg 


Pro 


His 


He 


Pro Gly Gin Ser Ala Gin Leu Phe 




5 








10 










15 






He 


Tyr 


Gin Met 


Ser 


Ser 


Gin 


Gin 


Leu 


Gin 


Gin 


Gin 


Pro 


Ser Ala Asn 


20 








25 










30 






35 


Lys 


Lys 


Ala Gly 


Lys 
40 


He 


His 


Asn 


Thr 


Pro 
45 


Phe 


Ala 


Asn 


Gin Leu Asn 
50 


Pro 


Thr 


Gin His 
55 


Leu 


Ala 


Lys 


Pro 


Phe 
60 


Gin 


Gin 


He 


Leu 


Pro Gly Arg 
65 


Gin 


Ser 


Gly Ser 
70 


Leu 


Thr 


Ser 


Pro 
75 


Phe 


Leu 


Ala 


Cys 







hk210> 189 
==k211> 207 
r|:212> PRT 
|5c213> Homo sapiens 

'lf:220> 

"<221> SIGNAL 
\<222> -42. .-1 



^k4Q0> 189 



i-^Met 


His 


He 


Leu 


Gin 


Leu 


Leu 


Thr 


Thr 


Val 


Asp Asp 


Gly 


He 


Gin 


Ala 






-40 










-35 








-30 








jjile 


Val 


His 


Cys 


Pro 


Asp 


Thr 


Gly Lys 


Asp 


He Trp 


Asn 


Leu 


Leu 


Phe 




-25 










-20 








-15 










'Asp 


Leu 


Val 


Cys 


His 


Glu 


Phe 


Cys 


Gin 


Ser Asp Asp 


Pro 


Pro 


He 


He 


-10 










-5 










1 






5 




Leu 


Gin 


Glu 


Gin 


Lys 


Thr 


Val 


Leu 


Ala 


Ser 


Val Phe 


Ser 


Val 


Leu 


Ser 








10 










15 








20 






Ala 


He 


Tyr 


Ala 


Ser 


Gin 


Thr 


Glu 


Gin 


Glu 


Tyr Leu Lys 


He 


Glu Lys 






25 










30 








35 








Val 


Asp 


Leu 


Pro 


Leu 


He 


Asp 


Ser 


Leu 


He 


Arg Val 


Leu 


Gin 


Asn 


Met 




40 










45 








50 










Glu 


Gin 


Cys 


Gin 


Lys 


Lys 


Pro 


Glu 


Asn 


Ser 


Ala Glu 


Ser 


Asn 


Thr 


Glu 


55 










60 










65 








70 


Glu 


Thr 


Lys 


Arg 


Thr 


Asp 


Leu 


Thr 


Gin 


Asp 


Asp Leu 


His 


Leu 


Lys 


He 










75 










80 








85 




Leu 


Lys 


Asp 


He 


Leu 


Cys 


Glu 


Phe 


Leu 


Ser 


Asn He 


Phe 


Gin 


Ala 


Leu 








90 










95 








100 






Thr 


Lys 


Glu 


Thr 


Val 


Ala 


Gin Gly Val 


Lys 


Glu Gly Gin Leu 


Ser 


Lys 






105 










110 








115 






Gin 


Lys 


Cys 


Ser 


Ser 


Ala 


Phe 


Gin 


Asn 


Leu 


Leu Pro 


Phe 


Tyr 


Ser 


Pro 




120 










125 








130 








Val 


Val 


Glu 


Asp 


Phe 


He 


Lys 


He 


Leu Arg Glu Val 


Asp 


Lys 


Ala 


Leu 



-150- 



135 140 145 150 

Ala Asp Asp Leu Glu Lys Asn Phe Pro Ser Leu Lys Val Gin Thr 
155 160 165 



<210> 190 

<211> 201 

<212> PRT 

<213> Homo sapiens 



<400> 190 



Met Gin 


Val 


Ala 


Leu 


Lys Glu Asp Leu Asp Ala Leu Lys Glu Lys Phe 


1 






5 










10 






15 


Arg Thr 


Met 


Glu 
20 


Ser 


Asn 


Gin 


Lys 


Ser 
25 


Ser 


Phe 


Gin 


Glu He Pro Lys 
30 


Leu Asn 


Glu 
35 


Glu 


Leu 


Leu 


Ser 


Lys 
40 


Gin 


Lys 


Gin 


Leu 


Glu Lys He Glu 
45 


Ser Gly 


Glu 


Met 


Gly Leu Asn Lys Val 


Trp 


He 


Asn 


He Thr Glu Met 


50 










55 










60 




Asn Lys 


Gin 


He 


Ser 


Leu 


Leu 


Thr 


Ser 


Ala 


Val 


Asn 


His Leu Lys Ala 


65 








70 










75 




80 


^-Asn Val 


Lys 


Ser 


Ala Ala Asp Leu 


He 


Ser 


Leu 


Pro 


Thr Thr Val Glu 








85 










90 






95 


!ljjly Leu 


Gin 


Lys 


Ser 


Val 


Ala 


Ser 


He Gly Asn Thr Leu Asn Ser Val 






100 










105 








110 


yilis Leu 


Ala 


Val 


Glu 


Ala 


Leu 


Gin 


Lys 


Thr 


Val 


Asp 


Glu His Lys Lys 




115 










120 










125 


|;Thr Met 


Glu 


Leu 


Leu 


Gin 


Ser 


Asp 


Met 


Asn 


Gin 


His 


Phe Leu Lys Glu 


m 130 










135 










140 


"'thr Pro 


Gly 


Ser 


Asn 


Gin 


He 


He 


Pro 


Ser 


Pro 


Ser 


Ala Thr Ser Glu 










150 










155 




160 


^jLeu Asp 


Asn 


Lys 


Thr 


His 


Ser 


Glu 


Asn 


Leu 


Lys 


Gin 


Met Gly Asp Arg 


l^^feer Ala 






165 










170 






175 


Thr 


Leu 


Lys 


Arg 


Gin 


Ser 


Leu 


Asp 


Gin 


Val 


Thr Asn Arg Thr 






180 










185 








190 


iijl^sp Thr 


Val 
195 


Lys 


He 


Gin 


Lys 


Lys 
200 


Lys 











<210> 191 
<211> 379 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -37 . . -1 

<400> 191 

Met Pro His Ser Ser Leu His Pro 
-35 -30 
Gly Ala Gin Lys Ala Ala Leu Val 

-20 -15 
Leu Trp Gly Leu Gly Glu Pro Pro 
-5 1 



Ser He Pro Cys Pro Arg Gly His 
-25 

Leu Leu Ser Ala Cys Leu Val Thr 
-10 

Glu His Thr Leu Arg Tyr Leu Val 
5 10 



-151- 



Leu His Leu Ala Ser Leu Gin Leu Gly Leu Leu Leu Asn Gly Val Cys 

15 20 25 

Ser Leu Ala Glu Glu Leu Arg His He His Ser Arg Tyr Arg Gly Ser 

30 35 40 

Tyr Trp Arg Thr Val Arg Ala Cys Leu Gly Cys Pro Leu Arg Arg Gly 

45 50 55 

Ala Leu Leu Leu Leu Ser He Tyr Phe Tyr Tyr Ser Leu Pro Asn Ala 
60 65 70 75 

Val Gly Pro Pro Phe Thr Trp Met Leu Ala Leu Leu Gly Leu Ser Gin 

80 85 90 

Ala Leu Asn He Leu Leu Gly Leu Lys Gly Leu Ala Pro Ala Glu He 

95 100 105 

Ser Ala Val Cys Glu Lys Gly Asn Phe Asn Val Ala His Gly Leu Ala 

110 115 120 

Trp Ser Tyr Tyr He Gly Tyr Leu Arg Leu He Leu Pro Glu Leu Gin 

125 130 135 

Ala Arg He Arg Thr Tyr Asn Gin His Tyr Asn Asn Leu Leu Arg Gly 
140 145 150 155 

Ala Val Ser Gin Arg Leu Tyr He Leu Leu Pro Leu Asp Cys Gly Val 

160 165 170 

Pro Asp Asn Leu Ser Met Ala Asp Pro Asn He Arg Phe Leu Asp Lys 
175 180 185 

.,r:Leu Pro Gin Gin Thr Gly Asp Arg Ala Gly He Lys Asp Arg Val Tyr 
J^; 190 195 200 

IJSer Asn Ser He Tyr Glu Leu Leu Glu Asn Gly Gin Arg Ala Gly Thr 

205 210 215 

j^lCys Val Leu Glu Tyr Ala Thr Pro Leu Gin Thr Leu Phe Ala Met Ser 
i;:220 225 230 235 

jpin Tyr Ser Gin Ala Gly Phe Ser Arg Glu Asp Arg Leu Glu Gin Ala 
n 240 245 250 

l^iys Leu Phe Cys Arg Thr Leu Glu Asp He Leu Ala Asp Ala Pro Glu 

255 260 265 

;;';Ser Gin Asn Asn Cys Arg Leu He Ala Tyr Gin Glu Pro Ala Asp Asp 

270 275 280 

;^i^er Ser Phe Ser Leu Ser Gin Glu Val Leu Arg His Leu Arg Gin Glu 
q 285 290 295 

j131u Lys Glu Glu Val Thr Val Gly Ser Leu Lys Thr Ser Ala Val Pro 
V|300 305 310 315 

='%er Thr Ser Thr Met Ser Gin Glu Pro Glu Leu Leu Leu Ser Gly Met 
320 325 330 

Gly Lys Pro Leu Pro Leu Arg Thr Asp Phe Ser 
335 340 



<210> 192 
<211> 112 
<212> PRT 

<213> Homo sapiens 
<400> 192 

Met Pro Ser Glu Gly Arg Cys Trp Glu Thr Leu Lys Ala Leu Arg Ser 

15 10 15 

Ser Asp Lys Gly Arg Leu Cys Tyr Tyr Arg Asp Trp Leu Leu Arg Arg 

20 25 30 

Glu Asp Val Leu Glu Glu Cys Met Ser Leu Pro Lys Leu Ser Ser Tyr 
35 40 45 



-152- 



Ser Gly Trp Val Val Glu His Val 

50 55 
Pro Leu Ser Glu Thr Ser Pro Ser 
65 70 
Gin Pro Ser Phe Val Pro Lys Ser 
85 

Pro Ala Ser Pro Ala Thr Pro Asn 
100 



Leu Pro His Met Gin Glu Asn Gin 
60 

Ser Thr Ser Ala Ser Ala Leu Asp 

75 80 
Pro Asp Ala Ser Ser Ala Phe Ser 

90 95 
Gly Thr Lys Gly Lys Lys Lys Lys 
105 110 



<210> 193 
<211> 43 
<212> PRT 

<213> Homo sapiens 
<400> 193 

Ser Leu Pro Gin Ala Leu Trp Phe Gin Phe Phe Tyr His Ser Gly Ser 

15 10 15 

Ser Leu Glu Ser Pro Gly Met Leu Asn Gly Pro Phe Gin His Arg Asn 

20 25 30 

Ser Arg lie Met Thr His Arg Ser Ala Glu Lys 
35 40 



J*|<210> 194 

j;^211> 51 

';'j<212> PRT 

';;ij<213> Homo sapiens 

■ <220> 

;;^<221> SIGNAL 
'^k222> -16. . -1 

^i,|<400> 194 

Jl^et Leu Arg He Ala Leu Thr Leu He Pro Ser Met Leu Ser Arg Ala 
]1 '^^ -5 

■'Ala Gly Trp Cys Trp Tyr Lys Glu Pro Thr Gin Gin Phe Ser Tyr Leu 
15 10 15 

Cys Leu Pro Cys Leu Ser Trp Asn Lys Lys Gly Asn Val Leu Gin Leu 
20 25 30 

Pro Asn Phe 
35 



<210> 195 
<211> 244 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -18 . . -1 



<400> 195 



-153- 



Met Ala Asn Pro Lys Leu Leu Gly Leu Glu Leu Ser Glu Ala Glu Ala 

-15 -10 -5 

He Gly Ala Asp Ser Ala Arg Phe Glu Glu Leu Leu Leu Gin Ala Ser 

15 10 
Lys Glu Leu Gin Gin Ala Gin Thr Thr Arg Pro Glu Ser Thr Gin He 
15 20 25 30 

Gin Pro Gin Pro Gly Phe Cys He Lys Thr Asn Ser Ser Glu Gly Lys 

35 40 45 

Val Phe He Asn He Cys His Ser Pro Ser He Pro Pro Pro Ala Asp 

50 55 60 

Val Thr Glu Glu Glu Leu Leu Gin Met Leu Glu Glu Asp Gin Ala Gly 

65 70 75 

Phe Arg He Pro Met Ser Leu Gly Glu Pro His Ala Glu Leu Asp Ala 

80 85 90 

Lys Gly Gin Gly Cys Thr Ala Tyr Asp Val Ala Val Asn Ser Asp Phe 
95 100 105 110 

Tyr Arg Arg Met Gin Asn Ser Asp Phe Leu Arg Glu Leu Val He Thr 

115 120 125 

He Ala Arg Glu Gly Leu Glu Asp He Tyr Asn Leu Gin Leu Asn Pro 

130 135 140 

Glu Trp Arg Met Met Lys Asn Arg Pro Phe Met Gly Ser He Ser Gin 
145 150 155 

^^|bln Asn He Arg Ser Glu Gin Arg Pro Arg He Gin Glu Leu Gly Asp 

160 165 170 

■;,|Leu Tyr Thr Pro Ala Pro Gly Arg Ala Glu Ser Gly Pro Glu Lys Pro 
'='^75 180 185 190 

J^His Leu Asn Leu Trp Leu Glu Ala Pro Asp Leu Leu Leu Ala Glu Val 

195 200 205 

j lVsp Leu Pro Lys Leu Asp Gly Ala Leu Gly Leu Ser Leu Glu He Gly 
PI 210 215 220 

^'krg Thr Ala Trp 
225 



^K210> 196 
J'|c211> 353 
yf:212> PRT 
<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -34 . . -1 



<400> 196 




















Met Glu 


Arg Gly 


Leu 


Lys 


Ser 


Ala 


Asp 


Pro 


Arg Asp Gly Thr 


Gly Tyr 






-30 










-25 






-20 


Thr Gly 


Trp Ala 


Gly 


He 


Ala 


Val 


Leu 


Tyr 


Leu 


His Leu Tyr 


Asp Val 




-15 










-10 






-5 




Phe Gly 


Asp Pro 


Ala 


Tyr 


Leu 


Gin 


Leu 


Ala 


His 


Gly Tyr Val 


Lys Gin 




1 






5 










10 


Ser Leu 


Asn Cys 


Leu 


Thr 


Lys 


Arg 


Ser 


He 


Thr 


Phe Leu Cys 


Gly Asp 


15 






20 










25 


30 


Ala Gly 


Pro Leu 


Ala 


Val 


Ala 


Ala 


Val 


Leu 


Tyr 


His Lys Met 


Asn Asn 






35 










40 






45 


Glu Lys 


Gin Ala 


Glu Asp Cys 


He 


Thr 


Arg 


Leu 


He His Leu 


Asn Lys 



50 55 60 



-154- 



He 


ASD 


Pro 


His 


Ala 


Pro 




xitsu. ^ y J- vjxy i^xy 


i.xe 


trxy iyr 






65 








7 0 


/ D 






He 


Tvr 
J. y ±. 


Ala 


Leu 


Leu 


Phe 


V jT^oii J_iy o 


r\oLl iriXC Vjxy Va.X 


oJ.U 


j-iys inr 




80 










85 


^ U 






Pro 


Gin 


Ser 


His 


He 


Gin 


Gin Tie Pv=i 


VJXU. XlIX XXC^ XICU. 


Thr 


Ser Gly 


95 










100 




X uo 




110 


Glu 


Asn 


Leu 


Ala 


Arg Lys 


Avo" AcjTi "Dlno 


iiix i^xo. ijys oer 


Pro 


Leu Met 










115 






Ton 
xz u 




125 


Tvr 


Glu 


Trn 


Tvr 


Gin 


Glu 


± y 1. xyi vdx 


Gly Ala Ala His 


Gly 


Leu Ala 








13 0 






lie: 




140 




Gly 


He 


Tvr 


Tvr 


Tyr 


Leu 


Mot" m n Pr*ri 


Ser Leu Gin Val 


Ser 


















155 




Lvs 


Leu 


His 


Ser 


Leu 


Val 


liVC* PtO *^*=iT* 


Val Asp Tyr Val 


Cys 


Gin Leu 




160 












170 






Lvs 


Phe 


Pro 


Ser 


Gly Asn 


TvT Pyn Ptti 

•L y X IrXW f X^ 


Cys He Gly Asp Asn 


Arg Asp 


175 










180 




185 




190 


Leu 


Leu 


Val 




Trp 


Cys 


iixs vjxy A±a 


Pro Gly Val He 


Tyr 


Met Leu 










195 






200 




205 


He 


Gin 


Ala 


TVT 


Lys 


Val 


irlifc: rixCJ vj_LU 


Glu Lys Tyr Leu 


Cys 


Asp Ala 








Z J. u 






215 




220 




Tvr 


Gin 


Cys 


Ala 


Asp 


Val 


xxc i X p 


Tyr Gly Leu Leu Lys 


Lys Gly 






Z ^ D 








230 


235 






m V 






His 


Gly 


ber Axa oxy 


Asn Ala Tyr Ala 


Phe 


Leu Thr 


Z^t U 










z4 b 


250 








lyr 






Thr 


Gin 


Asp Met Lys 


Tyr Leu Tyr Arg Ala 


Cys Lys 












260 




265 




270 


I^Phe 


Ala 


Glu 


Trp 


Cys 


Leu 


Glu Tyr Gly 


Glu His Gly Cys 


Arg 


Thr Pro 










275 






280 




285 


:|^SP 


Thr 


Pro 


Phe 


Ser 


Leu 


Phe Glu Gly 


Met Ala Gly Thr 


He 


Tyr Phe 


' Leu 






290 






295 




300 




Ala 


Asp 


Leu 


Leu 


Val 


Pro Thr Lys 


Ala Arg Phe Pro 


Ala 


Phe Glu 






305 








310 


315 



























J^210> 197 
]^211> 30 
^'<212> PRT 
<213> Homo sapiens 

<400> 197 

Met Gin Met Asp Thr Phe Phe Met Ser Glu Lys His Thr His Thr His 
15 10 15 

Thr His He His Thr His Thr Arg Lys Thr Lys Lys Lys Lys 
20 25 30 



<210> 198 
<211> 112 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -48. .-1 



-155- 



<400> 198 
Met Gin Asp Thr 
-45 

Tyr Ala Ala Leu 
-30 

Gly Ser Val Pro 
-15 

Gly Leu Gly Ala 
1 

Phe Leu Ala Thr 
20 

Tyr His Ser Gly 
35 

Leu Leu Met Val 
50 



Gly Ser Val Val 

Val Ala Ser Gly 
-25 

Ser Leu Ala Ala 
-10 

Tyr Gin Leu Ser 
5 

Ser Gly Thr Leu 

Lys Phe Met Pro 
40 

Ala Lys Val Gly 
55 



Pro Leu His Trp 
-40 

Gly lie He Gly 

Gly Leu Leu Phe 
-5 

Gin Asp Pro Arg 
10 

Ala Gly He Met 
25 

Ala Gly Leu lie 

Val Ser Met Phe 
60 



Phe Gly Phe Gly 
-35 

Tyr Val Lys Ala 
-20 

Gly Ser Leu Ala 

Asn Val Trp Val 
15 

Gly Met Arg Phe 
30 

Ala Gly Ala Ser 
45 

Asn Arg Pro His 



<210> 199 
<211> 54 
<212> PRT 
^k213> Homo sapiens 

ii:400> 199 

-iGlu He Ala Gly Tyr Gly Ala Glu 

J% 5 

jgPro Arg Trp His Arg Leu Pro Pro 

J J 2^ 

"jltys Gin Arg Arg Trp Pro Asp Arg 

35 40 
" Ser Ser Gly His Leu Pro 



Gly Phe Ser Ser Val Leu Gly Tyr 

10 15 
Gin Ser Leu Gin His His Gin Tyr 
25 30 
Arg Cys Leu Gin Ser His Thr Gin 
45 



j||:210> 200 
^ff211> 151 
''<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -21. . -1 

<400> 200 

Met Ala Ala Ser Thr Ser Met Xaa Pro Val Ala Val Thr Ala Ala Val 

-20 -15 -10 

Ala Pro Val Leu Ser He Asn Ser Asp Phe Ser Asp Leu Arg Glu He 
-5 15 10 

Lys Lys Gin Leu Leu Leu He Ala Gly Leu Thr Arg Glu Arg Gly Leu 

15 20 25 

Leu His Ser Ser Lys Trp Ser Ala Glu Leu Ala Phe Ser Leu Pro Ala 

30 35 40 

Leu Pro Xaa Gly Gin Leu Gin Pro Pro Pro Pro He Thr Glu Glu Asp 

45 50 55 

Ala Gin Asp Met Asp Ala Tyr Thr Leu Ala Lys Ala Tyr Phe Asp Val 



-156- 



60 65 



Lys Glu Tyr Asp Arg Ala 


Ala 


His 


80 






Lys Ala Tyr Phe Leu Tyr 


Met 


Tyr 


95 






Leu Lys Cys His Ser Ala 


Phe 


Ser 


110 




115 


Gly Lys Val Lys Ser Phe 


Lys 





125 130 



70 75 
Phe Leu His Gly Cys Asn Ser Lys 

85 90 
Ser Arg Tyr Leu Val Arg Ala lie 
100 105 
Glu Thr Ser lie Phe Arg Thr Asn 
120 



<210> 201 
<211> 228 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -25. .-1 



,,,x400> 2 01 

':^et Ser Met Ala 

if 25 

^iLeu Leu Met Leu 
i'lrhr Val His Gly 

.;p: 10 

.iWrp Phe Ser Cys 
J1 25 
Phe Pro Ser Met 

' :^0 

Ipeu. Met lie Thr 

"%le Ala Gly Leu 
'4 75 

|;|jys Ala Lys Leu 
11 90 

lie Cys Gly Met 
105 

Asp Phe Phe Asp 
12 0 

Ala Leu Tyr Leu 

Leu Cys Leu Cys 
155 

Ala Ser Ala Arg 
170 

Ala Thr Ser Asp 
185 

Asn Ala Tyr Val 
200 



Val Glu Thr Phe 
-20 

Gly Val Thr Leu 
-5 

Asn Val lie Thr 
15 

Ala Thr Asp Ser 
30 

Leu Ala Leu Ser 
45 

Ala lie Leu Leu 
60 

Arg Cys Thr Asn 

Ala Ala Thr Ala 
95 

Val Ala lie Ser 
110 

Pro Leu Tyr Pro 
125 

Gly Trp Ser Ala 
140 

Ser Ala Cys Cys 

Arg Pro Tyr Gin 
175 

Gin Glu Gly Asp 
190 



Gly Phe Phe Met 
-15 

Pro Asn Ser Tyr 
1 

Thr Asn Thr lie 



Leu 


Gly Val Tyr 




35 


Gly 


Tyr lie Gin 




50 


Gly 


Phe Leu Gly 




65 


lie 


Gly Gly Leu 


80 




Gly 


Ala Pro His 


Trp 


Tyr Ala Phe 




115 


Gly 


Thr Lys Tyr 




130 


Ser 


Leu lie Ser 




145 . 


Cys 


Gly Ser Asp 


160 




Ala 


Pro Val Ser 



Ser Ser Phe Gly 
195 



Ala 


Thr 


Val Gly 






-10 


Trp 


Arg 


Val Ser 




5 




Phe 


Glu 


Asn Leu 


20 






Asn 


Cys 


Trp Glu 


Ala 


Cys 


Arg Ala 






55 


Leu 


Leu 


Leu Gly 






70 


Glu 


Leu 


Ser Arg 




85 




He 


Leu 


Ala Gly 


100 






Asn 


He 


Thr Arg 


Glu 


Leu 


Gly Pro 






135 


He 


Leu 


Gly Gly 






150 


Glu 


Asp 


Pro Ala 



165 

Val Met Pro Val 
180 

Lys Tyr Gly Arg 



<210> 202 
<211> 64 



-157- 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -47. .-1 

<400> 202 

Met His Gly Phe Glu lie lie Ser Leu Lys Glu Glu Ser Pro Leu Gly 

-45 -40 -35 

Lys Val Ser Gin Gly Pro Leu Phe Asn Val Thr Ser Gly Ser Ser Ser 

-30 -25 -20 

Pro Val Thr Trp Leu Gly Leu Leu Ser Phe Gin Asn Leu His Cys Phe 
-15 -10 -5 1 

Pro Asp Leu Pro Thr Glu Met Pro Leu Arg Ala Lys Gly Val Asn Thr 
5 10 15 



<210> 203 

<211> 146 

,„;<:212> PRT 

|';;|:213> Homo sapiens 

iMffe220> 

5^;^221> SIGNAL 
U1c222> -31. . -1 

ih|=4 00> 203 

[j^l^et Met Trp Gin Lys Tyr Ala Gly Ser Arg Arg Ser Met Pro Leu Gly 

-30 -25 -20 

ij'^la Arg He Leu Phe His Gly Val Phe Tyr Ala Gly Gly Phe Ala He 
!;Jfl5 -10 -5 1 

;i'4/al Tyr Tyr Leu He Gin Lys Phe His Ser Arg Ala Leu Tyr Tyr Lys 

5 10 15 

"ieu Ala Val Glu Gin Leu Gin Ser His Pro Glu Ala Gin Glu Ala Leu 
y;i 20 25 3 0 

g;piy Pro Pro Leu Asn He His Tyr Leu Lys Leu He Asp Arg Glu Asn 
35 40 45 

Phe Val Asp He Val Asp Ala Lys Leu Lys He Pro Val Ser Gly Ser 
50 55 60 65 

Lys Ser Glu Gly Leu Leu Tyr Val His Ser Ser Arg Gly Gly Pro Phe 

70 75 80 

Gin Arg Trp His Leu Asp Glu Val Phe Leu Glu Leu Lys Asp Gly Gin 

85 90 95 

Gin He Pro Val Phe Lys Leu Ser Gly Glu Asn Gly Asp Glu Val Lys 
100 105 110 

Lys Glu 
115 



<210> 204 
<211> 87 
<212> PRT 

<213> Homo sapiens 



-158- 



<400> 204 

Met Glu Leu Ala Pro 
1 5 
Pro His Gly Val Leu 
20 

Ser Leu Leu Pro Gin 
35 

Leu Leu Arg Pro Val 
50 

Gin Asp His Ser Gly 
65 

Glu Val Asp Asp Trp 
85 



Thr Ala Arg Leu Pro Pro 
10 

Gly Pro Arg Ala Thr Gly 
25 

lie Lys Gin Arg Ala Ser 
40 

Thr Pro lie Thr Asn Phe 
55 

He Phe Gly Leu Val Thr 
70 75 
Glu Phe 



Gly His Gly Ser Leu 
15 

Ser Val Thr His Leu 
30 

Glu Ala Leu Pro Glu 
45 

Glu Gly Ser Gin Ser 
60 

Asn Leu Glu Glu Leu 
80 



<210> 205 
<211> 40 
<212> PRT 

<213> Homo sapiens 

.,.<220> 

=:;^221> SIGNAL 
4222> -27 . . -1 

'p 

«^<400> 205 

J'Met Arg Thr Leu Phe Gly Ala Val Arg Ala Pro Phe Ser Ser Leu Thr 
p -25 -20 -15 

jheu Leu Leu He Thr Pro Ser Pro Ser Pro Leu Leu Phe Asp Arg Gly 
j1 -10 -5 1 5 

"Leu Ser Leu Arg Ser Ala Met Ser 
10 



-^21Q> 206 

j;|:211> 154 

Vk212> PRT 

<213> Homo sapiens 



<400> 206 



Met 


Gly 


Ser 


Leu 


Ser Gly Leu Arg 


Leu 


Ala Ala Gly Ser Cys 


Phe Arg 


1 








5 








10 


15 


Leu 


Cys 


Glu 


Arg 


Asp 


Val Ser 


Ser 


Ser 


Leu Arg Leu Thr Arg 


Ser Ser 








20 








25 


30 




Asp 


Leu 


Lys 


Arg 


He 


Asn Gly Phe 


Cys 


Thr Lys Pro Gin Glu 


Ser Pro 






35 








40 




45 




Gly 


Ala 


Pro 


Ser 


Arg 


Thr Tyr 


Asn 


Arg 


Val Pro Leu His Lys 


Pro Thr 




50 








55 






60 




Asp 


Trp 


Gin 


Lys 


Lys 


He Leu 


He 


Trp 


Ser Gly Arg Phe Lys 


Lys Glu 


65 










70 






75 


80 


Asp 


Glu 


He 


Pro 


Glu 


Thr Val 


Ser 


Leu 


Glu Met Leu Asp Ala Ala Lys 










85 








90 


95 


Asn 


Lys 


Met 


Arg 


Val 


Lys Ser 


Ser 


Tyr 


Leu Met He Ala Leu 


Thr Val 








100 








105 


110 




Val 


Gly 


Cys 


He 


Phe 


Met Val 


He 


Glu Gly Lys Lys Ala Ala Gin Arg 






115 








120 




125 





-159- 



His Glu Thr Leu Thr Ser Leu Asn Leu Glu Lys Lys Ala Arg Leu Lys 

130 135 140 

Glu Glu Ala Ala Met Lys Ala Lys Thr Glu 
145 150 



<210> 207 

<211> 101 

<212> PRT 

<213> Homo sapiens 



<400> 207 

Met Val Cys Glu 

1 

Asp Thr Trp Lys 
20 

Lys Leu Asn Lys 
35 

Pro Tyr Gly Lys 
50 

,^yal His Gin Pro 
:;%5 

ihly lie Cys Ala 

'pi' 

-^^Lys Gin Thr Ser 
100 



Lys Cys Glu Lys 
5 

Asp Gly Ala Arg 

Asn Lys Ala Leu 
40 

Asn Lys Phe Ser 
55 

Gly Ser His Tyr 
70 

Met Cys Gly Lys 

85 

Val 



Lys Leu Gly Thr 
10 

Asn Thr Thr Glu 
25 

Thr Ser Lys Lys 

Thr Cys Arg lie 
60 

Cys Gin Gly Cys 
75 

Lys Val Leu Asp 
90 



Val lie Thr Pro 
15 

Ser Gly Gly Arg 
30 

Ala Arg Phe Asp 
45 

Cys Lys Ser Ser 

Ala Tyr Lys Lys 
80 

Thr Lys Asn Tyr 
95 



<210> 208 

\^<211> 456 

^^'^212> PRT 

^^k213> Homo sapiens 

'=;|c22 0> 

|;K221> SIGISTAL 
■V|c222> -22 . . -1 



<400> 208 



Met 


Phe 


Glu 


Glu 


Pro Glu Trp Ala Glu Ala Ala Pro Val Ala Ala Gly 






-20 










-15 








-10 








Leu 


Gly 
-5 


Pro 


Val 


He 


Ser 


Arg 
1 


Pro 


Pro 


Pro 


Ala 
5 


Ala Ser 


Ser 


Gin 


Asn 
10 


Lys 


Gly 


Ser 


Lys 


Arg Arg 


Gin 


Leu 


Leu 


Ala 


Thr 


Leu Arg 


Ala 


Leu 


Glu 










15 










20 








25 




Ala 


Ala 


Ser 


Leu 


Ser 


Gin 


His 


Pro 


Pro 


Ser 


Leu 


Cys lie 


Ser 


Asp 


Ser 








30 










35 








40 




Glu 


Glu 


Glu 
45 


Glu 


Glu 


Glu 


Arg 


Lys 
50 


Lys 


Lys 


Cys 


Pro Lys 
55 


Lys 


Ala 


Ser 


Phe 


Ala 


Ser 


Ala 


Ser 


Ala 


Glu 


Val 


Gly Lys 


Lys 


Gly Lys 


Lys 


Lys 


Cys 




60 










65 










70 








Gin 


Lys 


Gin 


Gly 


Pro 


Pro 


Cys 


Ser 


Asp 


Ser 


Glu 


Glu Glu 


Val 


Glu Arg 


75 










80 










85 








90 


Lys 


Lys 


Lys 


Cys 


His 


Lys 


Gin 


Ala 


Leu 


Val 


Gly Ser Asp 


Ser 


Ala 


Glu 










95 










100 








105 




Asp 


Glu 


Lys 


Arg 


Lys 


Arg 


Lys 


Cys 


Gin 


Lys 


His 


Ala Pro 


He 


Asn 


Ser 



-160- 



110 







IlX£3 


Leu 


Asp 


Asn 


vax 


Asp 
















Tin 
x^ U 






T'Vi V 
IXIX 


1 nr 


Asn 


Asp 


Pro 


Pro 




140 










145 




It X 




XT i. 


±rX (J 


nxo 


i.XXX 


Leu 


OCX 


155 










160 










Lys 


Arg 


Arg 


Cys 


Lys 


Asn 










175 












i-iXo. 


XT i. 


/iXa. 


Glu 


Ala 


Pro 








X J? VJ 










VciX 




Arg 


Thr 


Asp 


Ser 


His 


Gly 






n c: 

Z U D 










210 


Arg 




Ala 

AX a 




Arg 


Leu Asp Gly 




z Z u 










225 






Lsu 


Tyr 


Ser 


(aXy 


Pro 


Ser 


Ser 


O R 

<«:^ ^ D 










240 






Asp 


p]ro 


\j1U 


A±a. 


Pne 


Leu 


Leu 


Tyr 










O 

Z DO 








Lys 


Lys 
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Arg Arg 
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Cjiy Pro 
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Trp 
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Ser Pro 
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Ser 
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Ser 
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Arg 
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vjjxn Jirp 


Arg 


Asn Arg 
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170 


Lys 


Phe 
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Pro 


Gin 


Val 


Pro 
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Thr 


Glu 


juys inr 




Val 


Ser 


Pro 


195 








200 






Ala Arg 


Ala Gly 


Ala 


Leu Arg 
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Phe Arg 
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Ala 


Gin Arg 


Leu 


Phe 


Gin 


Glu 






245 








250 
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Arg 


Gly Pne 


Gin 


Ser 


Gin 


Val 




260 








265 




Asp 


Arg 


He Ala 


Arg 


Asp 


Leu 


Arg 


275 








280 






Ala 


Asp 


Pne Gly 


Cys 


Gly Asp 


Cys 








295 








Pro 


Val 


His Cys 


Phe 


Asp 


Leu 


Ala 






310 










Cys 


Asp 


Met Ala 


Gin 


Val 


Pro 


Leu 






325 








330 


Val 


Phe 


Cys Leu 


Ser 


Leu 


Met 


Gly 




340 








345 




Glu 


Ala 


Asn Arg 


Val 


Leu 


Lys 


Pro 


355 
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Val 


Ser 


Ser Arg 


Phe 


Glu Asp 


Val 
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Lys 


Leu 


Gly Phe 
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He 


Val 


Ser 
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Leu 


Phe Asp 
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Gly 
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Leu 
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425 





<210> 209 
<211> 98 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -17. . -1 

<400> 209 

Met Pro Ser Ser Phe Phe Leu Leu Leu Gin Phe Phe Leu Arg He Asp 

-15 -10 -5 

Gly Val Leu He Arg Met Asn Asp Thr Arg Leu Tyr His Glu Ala Asp 



-161- 



1 

Lys Thr Tyr Met 

Ser Leu Met His 
35 

Ser Gin Tyr Leu 
50 

Pro Glu Arg lie 

65 
Val Glu 
80 



5 

Leu Arg Glu Tyr 
20 

Val Pro Pro Ser 

Pro lie Lys Glu 
55 

Asp Pro Asn Pro 
70 



10 

Thr Ser Arg Glu 
25 

Leu Phe Thr Glu 
40 

Ala Val Cys Glu 

Ala Asp Ser Gin 
75 



15 

Ser Lys lie Ser 
30 

Pro Asn Glu lie 
45 

Lys Leu lie Phe 
60 

Lys Ser Thr Gin 



<210> 210 

<211> 83 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNTVL 

,^^<:222> -29. . -1 

i!l<400> 210 

ijket Thr Leu Leu Ser Phe Ala Ala Phe Thr Ala Ala Phe Ser Val Leu 

-25 -20 -15 

j'Pro Cys Tyr Tyr Leu Gly Leu Phe Gin Arg Ala Leu Ala Ser Val Phe 
|S -10 -5 1 

J |Vsp Pro Leu Cys Val Cys Ser Arg Val Leu Pro Thr Pro Val Cys Thr 
Jl ^ ^0 15 

"Leu Val Ala Thr Gin Ala Glu Lys lie Leu Glu Asn Gly Pro Cys Pro 

25 30 35 

;;'!rhr Lys Glu Ala Ala Gin Leu Val Gly Lys Gly Ser Val Ser Ala Arg 
40 45 50 

-h^n Ala Ser 
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<211> 229 


















<212> PRT 


















<213> Homo sapiens 
















<220> 


















<221> SIGNAL 


















<222> -23 . . -1 


















<400> 211 


















Met Gly Asp Lys 
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Trp 
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Pro Phe 


Pro Val Leu Leu 
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Ala 


Ala 


-20 








-15 
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Leu Pro Pro Val 


Leu 


Leu 


Pro 


Gly Ala Ala Gly Phe Thr Pro Ser Leu 


-5 








1 


5 








Asp Ser Asp Phe 


Thr 


Phe 


Thr 


Leu Pro 


Ala Gly Gin Lys 


Glu 


Cys 


Phe 


10 




15 






20 




25 


Tyr Gin Pro Met 


Pro 


Leu 


Lys 


Ala Ser 


Leu Glu He Glu 


Tyr 


Gin 


Val 




30 








35 




40 




Leu Asp Gly Ala 


Gly Leu Asp 


lie Asp 


Phe His Leu Ala 


Ser 


Pro 


Glu 
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45 50 55 
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xxe 
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Phe 


Glu 
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He 
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Asp Asn 


90 










95 
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VjjXU 
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(jjlU 


Asp 
115 


Trp 
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Lys 


Tyr 


He 
12 0 


Thr 




Tiir 


Asp 
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Leu 


Asp 
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Lys 


Leu 


Glu Asp 


He 


Leu 


Glu 


Ser 


He 
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130 










135 






Ser 


Ser 


lie 


Lys 


Ser 


Arg 


Leu 


Ser 

14 b 


Lys 


Ser 


Gly 


His 


He 
150 


Gin 


He 


Leu 


Leu 


Arg 
155 


Ala 


Phe 


Glu 


Ala 


Arg 
160 


Asp 


Arg 


Asn 


He 


Gin 
165 


Glu 


Ser 


Asn 


Phe 


Asp 


Arg 


Val 


Asn 


Phe 


Trp 


Ser 


Met 


Val 


Asn 


Leu 


Val 


Val 


Met 


Val 


Val 


170 










175 










180 










185 


Val 


Ser 


Ala 


lie 


Gin 
190 


Val 


Tyr 


Met 


Leu 


Lys 
195 


Ser 
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Phe 


Glu 
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200 
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Arg 
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Arg 
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3Ji:210> 212 

^-k211> 152 

y'lc212> PRT 

J|=213> Homo sapiens 

ii'i^220> 

|;;'<221> SIGNAL 
'\5222> -21, .-1 

k4Q0> 212 

^"ket Ala Gin Leu Gly Ala Val Val Ala Val Ala Ser Ser Phe Phe Cys 

-2 0 -15 -10 

a|Ala Ser Leu Phe Ser Ala Val His Lys He Glu Glu Gly His He Gly 

15 10 
Val Tyr Tyr Arg Gly Gly Ala Leu Leu Thr Ser Thr Ser Gly Pro Gly 

15 20 25 

Phe His Leu Met Leu Pro Phe He Thr Ser Tyr Lys Ser Val Gin Thr 

30 35 40 

Thr Leu Gin Thr Asp Glu Val Lys Asn Val Pro Cys Gly Thr Ser Gly 

45 50 55 

Gly Val Met He Tyr Phe Asp Arg He Glu Val Val Asn Phe Leu Val 
^0 65 70 75 

Pro Asn Ala Val His Asp He Val Lys Asn Tyr Thr Ala Asp Tyr Asp 

80 85 90 

Lys Ala Leu He Phe Asn Lys He His His Glu Leu Asn Gin Phe Cys 

95 100 105 

Ser Val His Thr Leu Gin Glu Val Tyr He Glu Leu Phe Gly Leu Glu 

110 115 120 

Asn Asp Phe Ser Gin Glu Ser Ser 
125 130 
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<210> 213 
<211> 179 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -54. .-1 



<400> 213 

Met Ala Ala Ser Glu Ala Ala Val Val Ser Ser Pro Ser Leu Lys Thr 

-50 -45 -40 

Asp Thr Ser Pro Val Leu Glu Thr Ala Gly Thr Val Ala Ala Met Ala 

-35 -30 -25 

Ala Thr Pro Ser Ala Arg Ala Ala Ala Ala Val Val Ala Ala Ala Ala 

-20 -15 -10 

Arg Thr Gly Ser Glu Ala Arg Val Ser Lys Ala Ala Leu Ala Thr Lys 

-5 15 10 

Leu Leu Ser Leu Ser Gly Val Phe Ala Val His Lys Pro Lys Gly Pro 

15 20 25 

Thr Ser Ala Glu Leu Leu Asn Arg Leu Lys Glu Lys Leu Leu Ala Glu 
30 35 40 

;4la Gly Met Pro Ser Pro Glu Trp Thr Lys Arg Lys Lys Gin Thr Leu 
I't 45 50 55 

^Lys lie Gly His Gly Gly Thr Leu Asp Ser Ala Ala Arg Gly Val Leu 

60 65 70 

.ri/al Val Gly lie Gly Ser Gly Thr Lys Met Leu Thr Ser Met Leu Ser 

80 85 90 

jpiy Ser Lys Arg Tyr Thr Ala lie Gly Glu Leu Gly Lys Ala Thr Asp 
'\k 95 100 105 

'^hr Leu Asp Ser Thr Gly Lys Val Thr Glu Glu Lys Pro Tyr Gly Met 
, 110 115 120 

^'?^sn Leu lie 
125 



j^|:210> 214 
<211> 269 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -92. .-1 



<400> 214 
Met lie Thr His 
-90 

Leu Glu Glu Leu 
-75 

Pro Ser Ser lie 
-60 

Arg Asn Ala Phe 

Val His Ser Ser 
-25 



Val Thr Leu Glu 
-85 

Pro Leu Pro Asp 
-70 

Met Tyr Gin Ala 
-55 

Val Thr Gly He 
-40 

Met Asn Glu Met 



Asp Ala Leu Ser 

Gin Gin Pro Cys 
-65 

Asn Phe Asp Thr 
-50 

Ala Arg Tyr He 
-35 

Leu Glu Glu Gly 
-20 



Asn Val Asp Leu 
-80 

He Glu Pro Pro 

Asn Phe Glu Asp 
-45 

Glu Gin Ala Thr 
-30 

His Glu Tyr Ala 
-15 
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Asp 
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70 
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Ser 


Val 
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Asp 
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Ser 
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Arg 


Ala 


Ala 
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Leu Arg Lys 
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Asp 
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Ser 
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Met 
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His 
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Arg 


He 
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Cys 


Leu 
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Glu 


Val 


He 


Pro 


Gly 


Tyr 


Glu 


Glu 
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Ala Asp 
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He Val 
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Asp 
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Glu 
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Met 
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^|c:212> PRT 




























<213> Homo sapiens 
























<22 0> 






























:^<221> SIGNAL 
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Ser 
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<210> 216 

<211> 67 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -38. .-1 

<400> 216 

Met Asn Asn Val Gin Pro Lys lie 
-35 

Val Lys Gly His Val Lys Met Leu 
-20 -15 
Ala Val Cys Cys Leu Ala Asp Gly 

-5 1 
Phe Asn Pro Asn Gly Pro Tyr Gin 
15 

Glu Val Leu 



Lys His Arg Pro Phe Cys Phe Ser 
-30 -25 
Arg Leu Val Phe Ala Leu Val Thr 
-10 

Ala Leu lie Tyr Arg Lys Leu Leu 

5 10 
Lys Lys Pro Val His Glu Lys Lys 
20 25 



f'^<210> 217 

^^^211> 125 

^;^^<212> PRT 

.J'l<213> Homo sapiens 



,;k22 0> 

"fK221> SIGNAL 
<222> -54. .-1 



■!;>400> 217 

';ket Ala Asp Glu Glu Leu Glu Ala Leu Arg Arg Gin Arg Leu Ala Glu 
''^ -50 -45 -40 

^Leu Gin Ala Lys His Gly Asp Pro Gly Asp Ala Ala Gin Gin Glu Ala 
1 -35 -30 -25 

JLys His Arg Glu Ala Glu Met Arg Asn Ser lie Leu Ala Gin Val Leu 

-20 -15 -10 

Asp Gin Ser Ala Arg Ala Arg Leu Ser Asn Leu Ala Leu Val Lys Pro 

-5 15 10 

Glu Lys Thr Lys Ala Val Glu Asn Tyr Leu lie Gin Met Ala Arg Tyr 

15 20 25 

Gly Gin Leu Ser Glu Lys Val Ser Glu Gin Gly Leu He Glu He Leu 

30 35 40 

Lys Lys Val Ser Gin Gin Thr Glu Lys Thr Thr Thr Val Lys Phe Asn 

45 50 55 

Arg Arg Lys Val Met Asp Ser Asp Glu Asp Asp Asp Tyr 
60 65 70 



<210> 218 
<211> 376 
<212> PRT 

<213> Homo sapiens 
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<220> 

<221> SIGNAL 
<222> -21. .-1 



<400> 218 
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Glu 
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<211> 211 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -30. .-1 



<400> 219 
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His 


Pro 


Arg Val 


Ala 


Phe 


Trp 


He He 


Lys 


Leu 


Pro 


111 100 










105 










110 








^JJ^rg Arg 


Arg 


Ser 


His 


Gin 


Asp 


Ala 


Leu 


Glu 


Gly 


Gly His 


Trp 


Leu 


Ser 










120 










125 








130 


^■p&lu Lys 


Arg 


His 


Arg 


Leu 


Gin 


Ala 


He 


Arg Asp 


Gly Leu 


Arg 


Lys 


Gly 








135 










140 








145 




Thr His 


Lys 


Asp 


Val 


Leu 


Glu Glu Gly Thr Glu 


Ser Ser 


Ser 


His 


Ser 






150 










155 








160 






j^^il^rg Leu 


Ser 


Pro 


Arg 


Lys 


Thr 


His 


Leu 


Leu 


Tyr 


He Leu 


Arg 


Pro 


Ser 



¥^ 165 170 175 

■^krg Gin Leu 



180 



<210> 220 
<211> 154 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -60. .-1 

<400> 220 

Met Gly Ser Lys Cys Cys Lys Gly 
-60 -55 
Arg Gin Arg Arg Gin Lys Leu Leu 
-40 

Arg Val Lys Ala Ala Gly Gin He 
-25 

Val Arg Arg Thr Leu Leu Val Ala 



Gly Pro Asp Glu Asp Ala Val Glu 
-50 -45 
Leu Ala Gin Leu His His Arg Lys 

-35 -30 
Gin Ala Trp Trp Arg Gly Val Leu 
-20 -15 
Ala Leu Arg Ala Trp Met He Gin 
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-10 -5 






1 






Cys 




Trp Arg Thr Leu Val Gin Arg Arg He Arg Gin Arg Arg Gin 






XU 


15 








20 


Ala 


Leu 


jjeu /irg vai lyr va± ixe 


^Ti". 

Gin Glu Gin 


Ala 


Thr Val 


Lys 


Leu 






25 


30 






35 




Gin 




Cys He Arg Met Trp Gin 


Cys Arg Gin 


Cys 


Tyr Arg 


Gin 


Met 






40 


45 




50 






Cys 


Asn 


Ala Leu Cys Leu Phe Gin 


Val Pro Glu 


Ser 


Ser Leu 


Ala 


Phe 






55 60 






65 






Gin 


Thr 


Asp Gly Phe Leu Gin Val 


Gin Tyr Ala 


He 


Pro Ser 


Lys 


Gin 




70 


75 




80 








Pro 


Glu 


Phe His He Glu He Leu 


Ser He 










85 




90 













<210> 221 
<211> 123 
<212> PRT 

<213> Homo sapiens 
, <220> 

:=|k221> SIGNAL 
I'k222> -42. .-1 



.j:<400> 221 












Jflet Lys 


Gly 


Gly Ala Phe 


Ser 


Asn 




-40 










-35 


] Ser Phe 


Leu 


Gin 


Pro 


Ser 


Leu 


Gin 


% -2 5 










-20 




^^kla Val 


Ser 


Leu 


Ser 


Ala 


Pro 


Ala 


'\-10 








-5 






^;^tjys Ser 


Ser 


Gin 


Ala 


Ala 


Arg 


Lys 






10 










■^iAsp Gly 


Asp 


Ser Gly Thr Ser Glu 




25 










30 


^^Pro Arg 


He 


Ser 


Cys 


His 


Gly Ala 


ri 4 0 










45 




''tys Leu 


Gly 


Cys 


Ser 


Cys 


Cys 


Thr 


55 








60 






Ser Leu 


Leu 


Ser 


Leu 


Glu 


Ala 


Pro 



75 



Leu Asn Asp Ser Gin Leu Ser Ala 
-30 

Ala Asn Cys Pro Ala Leu Asp Pro 
-15 

Phe Ala Ser Ala Leu Arg Ser Met 

1 5 
Asp Asp Phe Leu Arg Ser Leu Ser 
15 20 
His He Ser Ala Val Val Thr Ser 
35 

Ala He Pro Thr Ala Arg Ala Leu 
50 

Glu Arg Leu Leu Leu Pro Pro Pro 

65 70 
Ala Ser Thr 
80 



<210> 222 
<211> 346 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -19. . -1 



<400> 222 

Met Ala Met Ala Gin Lys Leu Ser His Leu Leu Pro Ser Leu Arg Gin 
-15 -10 -5 



-169- 



Val He Gin Glu 
1 

Val Asp Arg Ala 
15 

Ala Gly Tyr Arg 
30 

Leu Phe Gin Gin 

Ala Ala Leu Val 
65 

Asp Phe Trp Gly 
80 

Ala Ser Phe Thr 
95 

Ala Lys Ser Glu 
110 

Gly Val Ala Val 

Ala lie Glu Pro 
145 

Met Ala Ala Phe 
160 

^iKsys Tyr He Gin 
ll 175 

IPro Ser Val Leu 
.iil90 

.]'|He Phe Val Ser 

]pis Lys Cys Gin 
% 22 5 

'thr Phe Met Pro 
240 

"■'bin Gly His Gin 
255 

-^Gln Leu Glu Ala 
\g70 

fplu Pro Leu His 

lieu Thr Val Gly 
305 

Val Gin Arg Lys 
320 



Pro Gin Leu Ser 
5 

Glu Val Pro Pro 
20 

Pro Leu His Gin 
35 

His Asn Glu Ala 
50 

Leu Leu Leu Arg 

Asp Pro His Ala 
85 

Tyr Leu Ser Leu 
100 

Phe Trp His Tyr 
115 

Tyr Gin Phe Gly 
130 

Ala Trp His Ala 

Leu Ala Trp Leu 
165 

Lys Pro Gly Leu 
180 

Ala Tyr Ala Leu 
195 

Ser Asp Pro Thr 
210 

Val Val Phe Phe 

Glu Arg Trp Phe 
245 

Leu Phe His He 
260 

Val Ala Leu Asp 
275 

Thr His Trp Pro 
290 

Ser Ser He Leu 

Leu Asp Gin Lys 
325 



Leu Gin Pro Glu 

Leu Phe Trp Lys 
25 

Thr Trp Arg Phe 
40 

Val Asn Val Trp 
55 

Leu Ala Leu Phe 
70 

Leu Pro Leu Phe 

Ser Ala Leu Ala 
105 

Ser Phe Phe Phe 
120 

Ser Ala Leu Ala 
135 

Gin Val Gin Ala 
150 

Ser Cys He Gly 

Leu Gly Arg Thr 
185 

Asp He Ser Pro 
200 

Thr Asp Asp Pro 
215 

Leu Leu Ala Ala 
230 

Pro Gly Ser Cys 

Phe Leu Val Leu 
265 

Tyr Glu Ala Arg 
280 

His Asn Phe Ser 
295 

Thr Ala Phe Leu 
310 

Thr Lys 



Pro Val Phe Thr 
10 

Pro Tyr He Tyr 

Tyr Phe Arg Thr 
45 

Thr His Leu Leu 
60 

Val Glu Thr Val 
75 

He He Val Leu 
90 

His Leu Leu Gin 

Leu Asp Tyr Val 
125 

His Phe Tyr Tyr 
140 

Val Phe Leu Pro 
155 

Ser Cys Tyr Asn 
170 

Cys Gin Glu Val 

Val Val His Arg 
205 

Ala Leu Leu Tyr 
220 

Ala Phe Phe Ser 
235 

His Val Phe Gly 
250 

Cys Thr Leu Ala 

Arg Pro He Tyr 
285 

Gly Leu Phe Leu 
300 

Leu Ser Gin Leu 
315 



<210> 223 
<211> 210 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -20 . . -1 

<400> 223 

Met Asp Asn Arg Phe Ala Thr Ala Phe Val He Ala Cys Val Leu Ser 
-20 -15 -10 -5 
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T .^1 1 

UK^ U. 




C o 


1 liX 


X X c 1 y X rie C 
-1 


AX a 


iyr 




Tyr 


Arg 


1 

osr ir'ro vai 


tjin 






lb 






20 






Trp 


Asp 


vjxu JrJie xie 


Ser 




J u 










21 cm 




i-iX Cl 




irXlC ■H.xy 1 y X 


Asn 










t; n 
D u 




Cys 


lie 


Thr 


lie 














b D 




± 111, 


ox Ll 


oox 




/isp va.x vax 


Thr 








o n 






m 11 


oXIl 


DVio 




(aXU iiys iriie 


vai 






y b 






100 


lie 


Asp 


Leu 


Leu 


Arg Thr Tyr 


Leu 




IIU 






115 






vai 


Ser 


Leu 


vjiy Leu Met 


Cys 










130 




i-iJLci 




X X€ 


Cys 


Arg Ser Leu 


Tyr 










145 




His 


Leu 


Leu 


Ala 


Val Thr Lys 


Glu 








160 








Lys 


His 


Thr 


Ala Thr Pro 


Ala 






175 






180 




Lys 











190 



Ala 


Ser 


He 


Gly Thr Asp 


irne 1 rp 


5 






10 




Glu 


Asn 


Ser Ser Asp Leu 


Asn Lys 








25 




Asp 


Glu 


Ala 


Asp Glu Lys 


Thr Tyr 








40 




Gly Thr 


Val 


Gly Leu Trp 


Arg Arg 






55 




60 


His 


Trp 


Tyr 


Ser Pro Pro 


Glu Arg 




70 






75 


Lys 


Cys 


Val 


Ser Phe Thr 


Leu Thr 


85 






90 




Asp 


Pro 


Gly Asn His Asn 


Ser Gly 








105 




Trp Arg 


Cys 


Gin Phe Leu 


Leu Pro 








120 




Phe Gly 


Ala 


Leu He Gly 


Leu Cys 






135 




140 


Pro 


Thr 


He 


Ala Thr Gly 


lie Leu 




150 






155 


Ser 


Met 


Leu 


Pro Ala Gly 


Ala Glu 


165 






170 




His 


Ala 


Cys Val Gin Thr 


Gly Lys 



185 



i^i<:210> 


224 


<211> 


184 


.,<212> 


PRT 


:'>:213> 


Homo sapiens 


^^220> 




ik221> 


SIGNAL 


l!l=222> 


-20. ,-1 


'<400> 


224 



Met 


Asp 


Asn 


Arg 


Phe 


Ala 


Thr 


Ala Phe 


Val 


He Ala Cys Val 


Leu Ser 


-20 










-15 








-10 


-5 


Leu 


He 


Ser 


Thr 


He 


Tyr 


Met 


Ala Ala 


Ser 


He Gly Thr Asp 


Phe Trp 










1 






5 




10 




Tyr 


Glu 


Tyr 


Arg 


Ser 


Pro 


Val 


Gin Glu 


Asn 


Ser Ser Asp Leu 


Asn Lys 






15 










20 




25 




Ser 


He 


Trp 


Asp 


Glu 


Phe 


He 


Ser Asp 


Glu 


Ala Asp Glu Lys 


Thr Tyr 




30 










35 






40 




Asn 


Asp 


Ala 


Pro 


Phe 


Arg 


Tyr 


Asn Gly Thr 


Val Gly Leu Trp 


Arg Arg 


45 










50 








55 


60 


Cys 


He 


Thr 


He 


Pro 


Lys 


Asn 


Met His 


Trp 


Tyr Ser Pro Pro 


Glu Arg 










65 








70 




75 


Thr 


Glu 


Ser 


Phe 


Asp 


Val 


Val 


Thr Lys 


Cys 


Val Ser Phe Thr 


Leu Thr 








80 








85 




90 




Glu 


Gin 


Phe 


Met 


Glu 


Lys 


Phe 


Val Asp 


Pro 


Gly Asn His Asn 


Ser Gly 






95 










100 




105 


He 


Asp 


Leu 


Leu 


Arg 


Thr 


Tyr 


Leu Trp 


Arg 


Cys Gin Phe Leu 


Leu Pro 




110 










115 






120 





Phe Val Ser Leu Gly Leu Met Cys 
125 130 
Ala Cys lie Cys Arg Ser Leu Tyr 
145 

His Leu Leu Ala Asp Thr Met Leu 
160 
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Phe Gly Ala Leu lie Gly Leu Cys 
135 140 
Pro Thr He Ala Thr Gly He Leu 
150 155 



<210> 225 
<211> 227 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -22 . . -1 

<400> 225 



Met 


Gly Trp 


Thr 


Met 


Arg Leu 


Val 


Thr 


Ala 


Ala 


Leu 


Leu 


Leu Gly Leu 






-20 








-15 










-10 




=,Met 


Met 


Val 


Val 


Thr Gly Asp 


Glu Asp 


Glu 


Asn 


Ser 


Pro 


Cys Ala His 




-5 








1 








5 






10 


I'biu 


Ala 


Leu 


Leu 


Asp 


Glu Asp 


Thr 


Leu 


Phe 


Cys 


Gin 


Gly Leu Glu Val 










15 








20 








25 


'iphe 


Tyr 


Pro 


Glu 


Leu Gly Asn 


He 


Gly 


Cys 


Lys 


Val 


Val 


Pro Asp Cys 








30 








35 










40 


V3^sn 


Asn 


Tyr 
45 


Arg 


Gin 


Lys He 


Thr 
50 


Ser 


Trp 


Met 


Glu 


Pro 
55 


He Val Lys 


ifhe 


Pro 
60 


Gly 


Ala 


Val 


Asp Gly 
65 


Ala 


Thr 


Tyr 


He 


Leu 
70 


Val 


Met Val Asp 


Pro 


Asp 


Ala 


Pro 


Ser Arg Ala 


Glu 


Pro 


Arg 


Gin 


Arg 


Phe 


Trp Arg His 


75 










80 








85 






90 




Leu 


Val 


Thr 


Asp 


He Lys 


Gly Ala 


Asp 


Leu 


Lys 


Lys 


Gly Lys He 


IK 








95 








100 








105 




Gly Gin 


Glu 


Leu 


Ser Ala 


Tyr 


Gin 


Ala 


Pro 


Ser 


Pro 


Pro Ala His 








110 








115 










120 


I'Ber 


Gly Phe 


His 


Arg 


Tyr Gin 


Phe 


Phe 


Val 


Tyr 


Leu 


Gin Glu Gly Lys 






125 








130 










135 




Val 


He 


Ser 


Leu 


Leu 


Pro Lys 


Glu 


Asn 


Lys 


Thr 


Arg Gly Ser Trp Lys 




140 








145 










150 






Met 


Asp Arg 


Phe 


Leu 


Asn Arg 


Phe 


His 


Leu Gly 


Glu 


Pro 


Glu Ala Ser 


155 










160 








165 






170 


Thr 


Gin 


Phe 


Met 


Thr 
175 


Gin Asn 


Tyr 


Gin 


Asp 
180 


Ser 


Pro 


Thr 


Leu Gin Ala 
185 


Pro 


Arg 


Glu 


Arg 
190 


Ala 


Ser Glu 


Pro 


Lys 
195 


His 


Lys 


Asn 


Gin 


Ala Glu He 
200 



Ala Ala Cys 
205 



<210> 226 
<211> 74 
<212> PRT 

<213> Homo sapiens 
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<220> 

<221> SIGNAL 
<222> -41. .-1 

<400> 226 

Met He Ala Arg Arg Asn Pro Val 

-40 -35 
Ala Arg Ser Leu Pro Pro Pro Lys 
-25 -20 
He Gly Phe Leu Gly Tyr Cys Ser 
-5 

Arg Arg Pro He Ala Thr Ala Gly 

10 15 
Thr Ala Phe Phe Leu Leu Asp He 
25 30 



Pro Leu Arg Phe Leu Pro Asp Glu 
-30 

Leu Thr Asp Pro Arg Leu Leu Tyr 
-15 -10 
Gly Leu He Asp Asn Leu He Arg 

1 5 
Leu His Arg Gin Leu Leu Tyr He 
20 

He Leu 



<210> 227 
<211> 73 
<212> PRT 
^^^j.<213> Homo sapiens 

yl:400> 227 

[^jMet Glu Lys Tyr Glu Asn Leu Gly Leu Val Gly Glu Gly Ser Tyr Gly 
H=l 5 10 15 

llfiet Val Met Lys Cys Arg Asn Lys Asp Thr Gly Arg He Val Ala He 

::|;: 20 2 5 30 

hPy^ Lys Phe Leu Glu Ser Asp Asp Asp Lys Met Val Lys Lys He Ala 

in ^0 45 

''"Met Arg Glu Val Lys Leu Leu Lys Gin Leu Arg His Glu Asn Leu Val 

;;^^'Asn Leu Leu Glu Val Cys Lys Lys Lys 

iM.%5 70 



IV|c210> 228 

"<211> 82 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 

<222> -16. . -1 



<400> 228 
Met Lys Arg Leu 
-15 

Thr Leu He Ala 
1 

Asp Leu Phe Leu 
20 

Asn Leu Ser Ser 
35 

Val Arg Glu Lys 
50 



Leu Pro Ala Thr 
-10 

Pro Thr Pro Met 
5 

Phe Phe Lys Ser 

Cys Pro Phe Gly 
40 

Gin Ser Phe Cys 
55 



Ser Leu Ala Gly 
-5 

Leu Phe Cys Glu 
10 

His Lys Thr Trp 
25 

Asn Leu Phe Leu 

Met Asn Thr Glu 
60 



Pro Val Leu Ser 

Asp Lys Ser Trp 
15 

Gly He Ser Thr 
30 

Cys Val Gin Phe 
45 

Cys Asp Leu Arg 



-173- 



Lys Asn 
65 



<210> 229 
<211> 119 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -56. .-1 



<400> 229 
Met Ala Glu Pro 
-55 

Ser Phe Gly Ala 
-40 

Ala Cys Pro Ala 

r^Ber Phe Val Ser 

% "5 
::§er Gin Glu Gly 
^1 10 

'ile Met Thr Ser 

Jls 

.JILeu Thr lie Leu 

Mfle Leu Ala Lys 
60 



Ser Ala Ala Thr 
-50 

Glu Pro Ser Ala 
-35 

Leu Gly Thr Lys 
-20 

Ser Ser Ser Ser 

Leu Ser Ser Leu 
15 

Ser Phe Leu Ser 
30 

His Gly Glu Lys 
45 

Lys Lys Lys 



Gin Ser His Ser 
-45 

Pro Gly Gly Gly 
-30 

Ser Cys Ser Ser 
-15 

Gin Pro Val Ser 
1 

Cys Ser Asp Glu 
20 

Ser Ser Glu lie 
35 

Ser His Val Leu 
50 



lie Ser Ser Ser 

Gly Ser Pro Gly 
-25 

Ser Cys Ala Asp 
-10 

Leu Phe Ser Thr 
5 

Pro Ser Ser Glu 

His Asn Thr Gly 
40 

Gly Ser Gin Pro 
55 



■'-^210> 230 

^J:211> 54 

||:212> PRT 

j^213> Homo sapiens 

<400> 230 

Ala Phe Val Trp Glu Pro Ala Met 
1 5 
Ala Ser Glu Ala Ala Cys Leu lie 
20 

Asn Pro Arg Ser Thr Val Asp Ala 

35 40 
Gly Arg Gly Arg Pro His 
50 



Val Arg lie Asn Ala Leu Thr Ala 

10 15 
Val Ser Val Asp Glu Thr He Lys 
25 30 
Pro Thr Ala Ala Gly Arg Gly Arg 
45 



<210> 231 

<211> 210 

<212> PRT 

<213> Homo sapiens 



<220> 
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<221> SIGNAL 
<222> -14. .-1 



<400> 231 



Met 


Leu 


Thr 


Leu 


Leu Gly Leu 
-10 


Ser 


Phe 


He 
-5 


Leu 


Ala 


Gly 


Leu 


He 
1 


Val 


Gly 


Gly 


Ala 
5 


Cys 


He Tyr Lys 


Tyr 
10 


Phe 


Met 


Pro 


Lys 


Ser 
15 


Thr 


He 


Tyr 


Arg 


Gly 
20 


Glu 


Met 


Cys Phe Phe 
25 


Asp 


Ser 


Glu 


Asp 


Pro 
30 


Ala 


Asn 


Ser 


Leu 


Arg 


Gly 


Gly Glu 


Pro Asn Phe 


Leu 


Pro 


Val 


Thr 


Glu 


Glu 


Ala 


Asp 


He 


35 








40 








45 










50 


Arg 


Glu 


Asp 


Asp 


Asn He Ala 
55 


He 


He 


Asp 
60 


Val 


Pro 


Val 


Pro 


Ser 
65 


Phe 


Ser 


Asp 


Ser 


Asp 


Pro Ala Ala 


He 


He 


His 


Asp 


Phe 


Glu 


Lys 


Gly Met 








70 






75 










80 






Thr 


Ala 


Tyr 
85 


Leu 


Asp Leu Leu 


Leu 
90 


Gly 


He 


Cys 


Tyr 


Leu 
95 


Met 


Pro 


Leu 


Asn 


Thr 
100 


Ser 


He 


Val Met Pro 
105 


Pro 


Lys 


Asn 


Leu 


Val 
110 


Glu 


Leu 


Phe 


Gly 


Lys 


Leu 


Ala 


Ser 


Gly Arg Tyr 


Leu 


Pro 


Gin 


Thr 


Tyr 


Val 


Val 


Arg 


Glu 


rf.15 








120 








125 










130 




Leu 


Val 


Ala 


Val Glu Glu 


He 


Arg 


Asp 


Val 


Ser 


Asn 


Leu 


Gly 


He 










135 






140 










145 




■'the 


He 


Tyr 


Gin 


Leu Cys Asn 


Asn 


Arg 


Lys 


Ser 


Phe 


Arg 


Leu 


Arg Arg 








150 






155 










160 








Asp 


Leu 
165 


Leu 


Leu Gly Phe 


Asn 
170 


Lys 


Arg 


Ala 


He 


Asp 
175 


Lys 


Cys 


Trp 


Uijys 


He 
180 


Arg 


His 


Phe Pro Asn 
185 


Glu 


Phe 


He 


Val 


Glu 
190 


Thr 


Lys 


He 


Cys 



Gin Glu 



.£95 



.|210> 232 

J;|:211> 108 

^k212> PRT 

<213> Homo sapiens 

<400> 232 



Met 


Gly 


Cys 


Val 


Phe 


Gin 


Ser 


Thr 


Glu 


Asp 


Lys 


Cys 


He 


Phe 


Lys 


He 


1 








5 










10 










15 




Asp 


Trp 


Thr 


Leu 
20 


Ser 


Pro 


Gly 


Glu 


His 
25 


Ala 


Lys 


Asp 


Glu 


Tyr 
30 


Val 


Leu 


Tyr 


Tyr 


Tyr 


Ser 


Asn 


Leu 


Ser 


Val 


Pro 


He 


Gly 


Arg 


Phe 


Gin 


Asn Arg 






35 










40 










45 








Val 


His 


Leu 


Met 


Gly Asp 


He 


Leu 


Cys 


Asn 


Asp 


Gly 


Ser 


Leu 


Leu 


Leu 




50 










55 










60 










Gin 


Asp 


Val 


Gin 


Glu 


Ala 


Asp 


Gin 


Gly 


Thr 


Tyr 


He 


Cys 


Glu 


He 


Arg 


65 










70 










75 










80 


Leu 


Lys 


Gly 


Glu 


Ser 
85 


Gin 


Val 


Phe 


Lys 


Lys 
90 


Ala 


Val 


Val 


Leu 


His 
95 


Val 


Leu 


Pro 


Glu 


Glu 


Pro 


Lys 


Gly 


Thr 


Gin 


Met 


Leu 


Thr 











100 105 
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<210> 233 
<211> 43 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -18. .-1 



<400> 233 

Met Ser Ser Gly Arg Leu Arg Trp 
-15 

Trp Gly Ala Glu Lys Gly Glu Ser 

1 5 
Arg Leu Ala Asn Met Ala Lys Pro 
15 20 



Leu Met Pro Val lie Pro Ala Leu 
-10 -5 
Pro Glu Val Ser Ser Phe Glu Thr 
10 

Cys Leu Tyr 
25 



<210> 234 

l^212> PRT 

iii%213> Homo sapiens 

^|400> 234 

jMet Ser Ala Arg lie Pro Phe Tyr Lys Asp Thr Ser Gin lie Arg Leu 
li 5 10 15 

-Jily Ser Thr lie lie Pro His Phe Asn Leu lie Thr Phe Val Lys Thr 
il 20 25 30 

• Phe Phe Gin lie 
35 



.|210> 235 

j|211> 307 

^l:212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -13 . . -1 



<400> 235 
Met Leu Ala Val 
-10 

Leu Glu Ser Pro 
5 

Leu Leu Leu Gly 
20 

Arg Leu Phe Glu 

Gly Asp Val Met 
55 

Glu Asn Gly Glu 
70 



Ser Leu Thr Val 

lie Asp Pro Gin 
10 

Val Leu His Pro 
25 

Asn Gin Leu Val 
40 

Phe Thr Gly Thr 

He Glu Thr He 
75 



Pro Leu Leu Gly 
-5 

Pro Leu Ser Phe 
15 

Asn Thr Lys Leu 
30 

Gly Pro Glu Ser 
45 

Ala Asp Gly Arg 
60 

Ala Arg Phe Gly 



Ala Met Met Leu 
1 

Lys Glu Pro Pro 

Arg Gin Ala Glu 
35 

He Ala His He 
50 

Val Val Lys Leu 
65 

Ser Gly Pro Cys 
80 



Lys 


Thir 


Arg 


Asp 


Asp 


VjjJ_U 


Pro 


Val 




85 










90 






oxy 


Pro 


Asn 


vjjxy 


Thr 


Leu 


Fne 


100 










105 






irlie 


oXU. 


vax 


Asn 


Pro 


Trp 


Lys 


Arg 










120 










Thr 


Pro 


He 


Glu 


Gly 


Lys 


Asn 








135 










Val 


Ser 


Gin 


Asp 


Gly 


Arg 


Lys 


He 






150 










155 


Trp 


Gin 


Arg 


Arg 


Asp 


Tyr 


Leu 


Leu 




165 










170 




vjiy 


Arg" 


Leu 


Leu 


Glu 


Tyr 


Asp 


Thr 


180 










185 






Leu 


Asp 


Gin 


Leu 


Arg 


Phe 


Pro 


Asn 










200 








Asp 


Phe 


Val 


Leu 


Val 


Ala 


Glu 


Thr 








215 










Tyr 


Val 


Ser 


Gly 


Leu 


Met 


Lys 


Gly 






230 










235 


Met 


Pro 


Gly 


Phe 


Pro 


Asp 


Asn 


He 




245 










250 






Val 


Gly 


Met 


Ser 


Thr 


He 


Arg 


i^60 










265 








Phe 


Leu 


Ser 


Glu 


Arg 


Pro 


Trp 










280 










Lys 


Lys 












<210> 236 












■^211> 106 












iic212> PRT 












.<:213> Homo sapiens 








;|:220> 














^^221> SIGNAL 










,l:222> -32 . . - 


-1 










<400> 236 












Met 


Phe 


Ala 


Pro 


Ala 


Val 


Met 


Arg 






-30 










-25 


Gly Tyr Gly 


Val 


Pro 


Met 


Leu 


Leu 




-15 










-10 




Leu Arg 


Glu 


Phe 


Ser 


Gin 


He 


Arg 


1 








5 








Asp 


Pro 


Glu 


Leu 


Glu 


Lys 


Lys 


Leu 








20 










Ser 


Glu 


Tyr 


Glu 


Lys 


He 


Lys 


Asp 






35 










40 


He 


Arg Gly 


Pro 


Arg 


Pro 


Trp 


Glu 




50 










55 




Asn 


Pro 


Glu 


Ser 


Leu 


Lys 


Thr 


Lys 


65 










70 







-176- 



Cys 


Gly Arg Pro 


Leu 


Gly 


He Arg 






95 








vax 


Ala Asp Ala 


Cys 


Lys 


Gly Leu 






110 






115 


laXU 


Val 


Lys Leu 


Leu 


T /-V1 1 

Leu 


Ser Ser 




125 








130 


Met 


Ser 


Phe Val 


Asn 


Asp 


Leu Thr 


140 








145 




Tyr 


Phe 


Thr Asp 


Ser 


Ser 


Ser Lys 








160 






Leu 


Val 


Met Glu 


Gly 


Thr 


Asp Asp 






175 








Val 


Thr 


Arg Glu 


Val 


Lys 


Val Leu 






190 






195 


Gly 


Val 


Gin Leu 


Ser 


Pro 


Ala Glu 




205 








210 


Thr 


Met 


Ala Arg 


He 


Arg 


Arg Val 


220 








225 




Gly 


Ala 


Asp Leu 


Phe 


Val 


Glu Asn 








240 






Arg 


Pro 


Ser Ser 


Ser 


Gly 


Gly Tyr 






255 








Pro 


Asn 


Pro Gly 


Phe 


Ser 


Met Leu 






270 






275 


He 


Lys 


Arg Met 


He 


Phe 


Lys Ala 




285 








290 


Ala 


Phe Arg Lys 


Asn 


Lys 


Thr Leu 








-20 






Leu 


He 


Val Gly 
-5 


Gly 


Ser 


Phe Gly 


Tyr 


Asp 


Ala Val 


Lys 


Ser 


Lys Met 




10 








15 


Lys 


Glu 


Asn Lys 


He 


Ser 


Leu Glu 


25 








30 




Ser 


Lys 


Phe Asp 


Asp 


Trp 


Lys Asn 








45 






Asp 


Pro 


Asp Leu 


Leu 


Gin 


Gly Arg 






60 








Thr 


Thr 











-177- 

<210> 237 
<211> 42 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -19. .-1 

<400> 237 

Met Asp Leu Arg Gin Phe 
-15 

Ala Leu Ser Lys Pro Thr 
1 

Gin Leu Ser Asp Lys Val 
15 



Leu Met Cys Leu Ser Leu Cys Thr Ala Phe 

-10 -5 
Glu Lys Lys Asp Arg Val His His Glu Pro 

5 10 
His Asn Asp lie 
20 



<210> 238 

<211> 117 

■^^<212> PRT 

?;^213> Homo sapiens 

^;<22 0> 

%221> SIGNAL 
^^|:222> -20. .-1 



=^i:400> 238 



UMet 


Asp 


Asn 


Arg 


Phe 


Ala 


Thr 


Ala 


Phe 


Val 


He 


Ala Cys 


Val 


Leu Ser 


-20 










-15 










-10 






-5 


^^Ejeu 


He 


Ser 


Thr 


He 


Tyr 


Met 


Ala 


Ala 


Ser 


He 


Gly Thr 


Asp 


Phe Trp 










1 








5 








10 






Glu 


Tyr 


Arg 


Ser 


Pro 


Val 


Gin 


Glu 


Asn 


Ser 


Ser Asp 


Leu 


Asn Lys 






15 










20 








25 






^.3er 


He 


Trp 


Asp 


Glu 


Phe 


He 


Ser 


Asp 


Glu 


Ala 


Asp Glu 


Lys 


Thr Tyr 




30 










35 










40 






^■^J\sn 


Asp 


Ala 


Leu 


Phe 


Arg 


Tyr Asn Gly Thr Val 


Gly Leu 


Trp 


Gly Arg 


45 










50 










55 






60 


Cys 


He 


Thr 


He 


Pro 


Lys 


Asn 


Met 


His 


Trp 


Tyr 


Ser Pro 


Pro 


Glu Arg 










65 










70 








75 


Thr 


Gly 


He 


Ser 


Leu 


He 


Leu 


Thr 


Ser 


Val 


Phe 


Phe Thr 


Trp 


Leu He 








80 










85 








90 




He 


Asp 


Lys 


Thr 


Thr 





















95 



<210> 239 
<211> 178 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -37. .-1 



-178- 



<400> 239 
Met Glu Arg Gin 
-35 

Gin His Xaa Xaa 
-20 

Leu Thr lie Leu 
-5 

Leu His Glu Thr 
15 

lie Ser Arg Tyr 
30 

Cys Asp Cys Val 
45 

Val Thr Asp Gin 
60 

His Asn lie Asn 

Phe Asp Pro Glu 
95 

His Ala Gly Tyr 
110 

^^^Ser lie Leu Thr 
Z 125 
ttle Gly 
14 0 



Ser Arg Val Met 
-30 

Ala Xaa Xaa Leu 
-15 

Thr lie Trp Leu 
1 

Gly Gly Ala Met 

Ala Thr Ala Pro 
35 

Lys Leu Thr Phe 
50 

Val Tyr Glu Tyr 
65 

Pro His Gin Gly 
80 

lie Phe Phe Asn 

Ser Leu Lys Lys 
115 

Tyr Ala Phe Leu 
130 



Ser Glu Lys Asp 

Leu Val Phe Asn 
-10 

Phe Lys Asn His 
5 

Val Tyr Gly Leu 
20 

Thr Asp lie Glu 

Ser Pro Pro Thr 
55 

Lys Tyr Lys Arg 
70 

Asn Ala lie Leu 
85 

Val Leu Leu Pro 
100 

Arg His Phe Phe 

Gly Thr Ala He 
135 



Glu Tyr Gin Phe 
-25 

Phe Leu Leu He 

Arg Phe Arg Phe 
10 

He Met Gly Leu 
25 

Ser Gly Thr Val 
40 

Leu Leu Val Asn 

Glu lie Ser Gin 
75 

Glu Lys Met Thr 
90 

Pro lie He Phe 
105 

Gin Asn Leu Gly 
120 

Ser Cys He Val 



^^210> 240 

fl:211> 126 

<212> PRT 

:ii<213> Homo sapiens 

^220> 

.V221> SIGNAL 
!;k222> -27. .-1 

y 

^ilc400> 240 

Met Gin Phe Val Asn Val Gly Tyr Phe Leu He Ala Ala Gly Val Val 

-25 -20 -15 

Val Leu Ala Leu Gly Phe Leu Gly Cys Tyr Gly Ala Lys Thr Glu Ser 

-10 -5 15 

Met Cys Ala Leu Val Thr Phe Phe Phe He Leu Leu Leu He Phe He 

10 15 20 

Ala Glu Val Ala Ala Ala Val Val Ala Leu Val Tyr Thr Thr Met Ala 

25 30 35 

Glu His Phe Leu Thr Leu Leu Val Val Pro Ala He Lys Lys Asp Tyr 

40 45 50 

Gly Ser Gin Glu Asp Phe Thr Gin Val Trp Asn Thr Thr Met Lys Gly 

55 60 65 

Leu Lys Cys Arg Gly Phe Thr Asn Tyr Thr Asp Phe Glu Asp Ser Pro 
70 75 80 85 

Tyr Phe Lys Met His Lys Pro Val Thr Met Lys Lys Lys Lys 
90 95 



-179- 



<210> 241 
<211> 174 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -115. .-1 



<400> 241 



Met 


Arg 


Trp 


Ser 


Cys 


Glu 


His 


Leu 


Val 


Met Val 


Trp 


He 


Asn 


Ala 


Phe 


-115 








-110 






-105 








-101 


Val 


Met 


Leu 


Thr 


Thr 


Gin 


Leu 


Leu 


Pro 


Ser Lys 


Tyr 


Cys 


Asp 


Leu 


Leu 










-95 










-90 








-85 




His 


Lys 


Ser 


Ala 


Ala 


His 


Leu 


Gly Lys 


Trp Gin 


Lys 


Leu 


Glu 


His 


Gly 








-80 










-75 








-70 






Ser 


Tyr 


Ser 


Asn 


Ala 


Pro 


Gin 


His 


He 


Trp Ser 


Glu 


Asn 


Thr 


He 


Trp 






-65 










-60 








-55 








Pro 


Gin 


Gly 


Val 


Leu 


Val 


Arg 


His 


Ser 


Arg Cys 


Leu 


Tyr 


Arg 


Ala 


Met 




-50 










-45 








-40 










Gly 


Pro 


Tyr 


Asn 


Val 


Ala 


Val 


Pro 


Ser 


Asp Val 


Ser 


His 


Ala 


Arg 


Phe 


:;r35 










-30 








-25 










-20 


pfyr 


Phe 


Leu 


Phe 


His 


Arg 


Pro 


Leu 


Arg 


Leu Leu 


Asn 


Leu 


Leu 


He 


Leu 








-15 










-10 








-5 






Glu 


Gly 


Gly Val Val 


Phe 


Tyr 


Gin 


Leu Tyr 


Ser 


Leu 


Leu 


Arg 


Ser 


Ifalu 






1 








5 








10 








Lys 


Trp 


Asn 


His 


Thr 


Leu 


Ser 


Met 


Ala Leu 


He 


Leu 


Phe 


Cys 


Asn 




15 










20 








25 












Tyr 


Val 


Leu 


Phe 


Lys 


Leu 


Leu Arg Asp Arg 


He 


Val 


Leu 


Gly 


Arg 












35 








40 










45 


Ala 


Tyr 


Ser 


Tyr 


Pro 


Leu 


Asn 


Ser 


Tyr 


Glu Leu 


Lys 


Ala 


Asn 







^^^^ 50 55 



^'i:210> 242 
\^211> 896 
^^k212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 18. .173 

<221> sigjpeptide 
<222> 18 . , 77 
<223> Von Heijne matrix 
score 6.5 

seq GLCVLQLTTAVTS/AF 

<221> polyA_signal 
<222> 864. .869 

<221> polyA_site 
<222> 882 . . 893 



<400> 242 



-180- 



aaccttcaca gtgtgag atg cct agt gtg aac agt get gga tta tgt gtc 50 

Met Pro Ser Val Asn Ser Ala Gly Leu Cys Val 
-20 -15 -10 

ttg cag ttg aca acg gca gtr acc agt gcc ttt tta eta gca aaa gtg 98 
Leu Gin Leu Thr Thr Ala Val Thr Ser Ala Phe Leu Leu Ala Lys Val 

-5 15 

aat cct ttc gaa ret ttt etc tea agg ggc ttt tgg eta tgt get gcc 146 
Asn Pro Phe Glu Xaa Phe Leu Ser Arg Gly Phe Trp Leu Cys Ala Ala 

10 15 20 

cat cat ttc att cat cct tgc ctg gat tgagacgtgt tcctgattea 193 
His His Phe lie His Pro Cys Leu Asp 

25 30 

aagtgttacc teaagaagea gaagaagaaa acagactcet gatagttcag gatgettcag 253 

agagggcagc acttatacet ggtggtcttt etgatggtca gttttattec cctcctgaat 313 

ccgaagcagg atetgaagaa getgaagaaa aacaggacag tgagaaacca cttttagaac 373 

tatgagtact aettttgtta aatgtgaaaa accctcacag aaagtcateg aggcaaaaag 433 

aggeaggcag tggagtctec ctgtcgaeag taaagttgaa atggtgacgt ceactgetgg 493 

ctttattgaa eagetaataa agatttattt attgtaatac etcaeagacg ttgtaccata 553 

tecatgcaca tttagttgcc tgcctgtggc tggtaaggta atgtcatgat tcatcctctc 613 

ttcagtgaga ctgagcctga tgtgttaaca aataggtgaa gaaagtcttg tgctgtatte 673 

etaatcaaaa gacttaatat attgaagtaa eactttttta gtaagcaaga taccttttta 733 

|;;1:tteaattca cagaatggaa tttttttgtt tcatgtctca gatttatttt gtatttcttt 793 

^jtttaacaetc tacatttecc ttgtttttta actcatgcac atgtgctctt tgtacagttt 853 

a^'fcaaaaagtgt aataaaatet gaeatgtcaa araaaaaaaa mey 896 



;i^=<210> 243 

yiik211> 851 

y'lc212> DNA 

^i- <213> Homo sapiens 

hh 

n ^220 > 

i^^^221> CDS 

r'^222> 17. .595 

JJ;^221> sig_jpeptide 
Wk222> 17. .85 
<223> Von Heijne matrix 

score 3.70000004768372 

seq FLPPLXRAFACRG/CQ 

<221> polyA_signal 
<222> 820. .825 

<221> polyA_site 
<222> 840 , . 851 

<400> 243 

aagggggcgt ggggcc atg gtg gtc ttg egg gcg ggg aag aag acc ttt etc 52 
Met Val Val Leu Arg Ala Gly Lys Lys Thr Phe Leu 
-20 -15 
ecc cct ctm wgc cgc gee ttc gcc tgc cge ggc tgt caa etc get ccg 100 
Pro Pro Leu Xaa Arg Ala Phe Ala Cys Arg Gly Cys Gin Leu Ala Pro 

-10 -5 15 

gag cgc ggc gcc gag cgc agg gat aca gcg ccc age ggg gtc tea aga 148 
Glu Arg Gly Ala Glu Arg Arg Asp Thr Ala Pro Ser Gly Val Ser Arg 



-181- 



10 15 20 

ttc tgc cct cca aga aag tct tgc cat gat tgg ata gga ccc cca gat 196 
Phe Cys Pro Pro Arg Lys Ser Cys His Asp Trp lie Gly Pro Pro Asp 

25 30 35 

aaa tat tea aac ctt cga cct gtt cac ttt tac ata cct gaa aat gaa 244 
Lys Tyr Ser Asn Leu Arg Pro Val His Phe Tyr lie Pro Glu Asn Glu 

40 45 50 

tct cca ttg gaa caa aag ctt aga aaa tta aga caa gaa aca caa gaa 292 
Ser Pro Leu Glu Gin Lys Leu Arg Lys Leu Arg Gin Glu Thr Gin Glu 

55 60 65 

tgg aat caa cag ttc tgg gca aac cag aat ttg act ttt agt aag gaa 340 
Trp Asn Gin Gin Phe Trp Ala Asn Gin Asn Leu Thr Phe Ser Lys Glu 
70 75 80 85 

aaa gaa gaa ttt att cac tea aga eta aaa act aaa ggc ctg ggc ctg 388 
Lys Glu Glu Phe lie His Ser Arg Leu Lys Thr Lys Gly Leu Gly Leu 

90 95 100 

aga act gaa tea ggt cag aaa gca aea ttg aat gca gaa gaa atg gcg 436 
Arg Thr Glu Ser Gly Gin Lys Ala Thr Leu Asn Ala Glu Glu Met Ala 

105 110 115 

gac ttc tac aag gaa ttt tta agt aaa aat ttt cag aag cac atg tat 484 
Asp Phe Tyr Lys Glu Phe Leu Ser Lys Asn Phe Gin Lys His Met Tyr 
Pl 120 125 130 

Ifcat aac aga gat tgg tae aag cgc aat ttt gee ate acc ttc ttc atg 532 
'|:yr Asn Arg Asp Trp Tyr Lys Arg Asn Phe Ala He Thr Phe Phe Met 
I 135 140 14 5 

gga. aaa gtg gee ctg gaa agg att tgg aac aag ctt aaa cag aaa caa 580 
Gly Lys Val Ala Leu Glu Arg He Trp Asn Lys Leu Lys Gin Lys Gin 
^50 155 160 165 

.;kag aag agg age aac taggagteca etctgaccca gceagagtcc aggtttccac 635 
tys Lys Arg Ser Asn 
170 

.aggaagcara tggagctcct ttcacagggg ctctgagaaa aactggaget gatctcaaga 695 
^^^geeccacat ettcctaagg ggccecatgg eetgtttggg ggcagggtag gtcetggggc 755 
^ctgtgggee gcctgcetgc tgatgtgggc tetaggeeag cttgttgtca egtacgtggt 815 
gtgaaataaa gcceaagcac tgggaaaaaa aaaaaa 851 



<210> 244 
<211> 495 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 89. .334 



<221> sig_jpeptide 

<222> 89. ,130 

<223> Von Heijne matrix 

score 3.59999990463257 
seq AFTLXSLLQAALL/CV 



<221> polyA_signal 
<222> 462 . ,467 



<221> polyA_site 



-182- 



<222> 484, .495 



<400> 244 

agtaggaasg cgccgsccgt ggaggcgcca cgtcccttgc sgcggcggga gagamatcgc 60 
ttggacttcg gggcggcctc ggacggcc atg gcc ttt acc ctg tas tea ctg 112 

Met Ala Phe Thr Leu Xaa Ser Leu 
-10 



ctg 


cag 


gca 


gcc 


ctg 


etc 


tgc 


gtc 


aac 


gcc 


ate 


gca 


gtg 


ctg 


cac 


gag 


160 


Leu 


Gin 


Ala 


Ala 


Leu 


Leu 


Cys 


Val 


Asn 


Ala 


He 


Ala 


Val 


Leu 


His 


Glu 






-5 










1 








5 










10 




gag 


cga 


ttc 


etc 


aag 


aac 


att 


ggc 


tgg 


gga 


aca 


gae 


cag 


gga 


att 


ggt 


208 


Glu 


Arg 


Phe 


Leu 


Lys 


Asn 


He 


Gly 


Trp 


Gly 


Thr 


Asp 


Gin 


Gly He 


Gly 












15 










20 










25 






gga 


ttt 


gga 


gaa 


gag 


ccg 


gga 


att 


aaa 


tea 


sag 


sta 


atg 


avs 


ctt 


att 


256 


Gly 


Phe 


Gly 


Glu 


Glu 


Pro 


Gly 


He 


Lys 


Ser 


Xaa 


Xaa 


Met 


Xaa 


Leu 


He 










30 










35 










40 








cga 


tct 


gta 


aga 


acc 


gtg 


atg 


aga 


gtg 


cca 


ttg 


ata 


ata 


gta 


aac 


tea 


304 


Arg 


Ser 


Val 


Arg 


Thr 


Val 


Met 


Arg 


Val 


Pro 


Leu 


He 


He 


Val 


Asn 


Ser 








45 










50 










55 










att 


gca 


att 


gtg 


tta 


ctt 


tta 


tta 


ttt 


gga 


tgaatwteat tggagaaaat 


354 


He 


Ala 


He 


Val 


Leu 


Leu 


Leu 


Leu 


Phe 


Gly 

















Cll ^5 

.jSggakactcag aaraggacat gecaktaraa kttattactt tggtcattat tggaatattt 414 

s^atatcttage tggetgacct tgcaettgtc aaaaatgtaa agctgaaaat aaaaeeaggg 474 

rittctattta aaaaaaaaaa a 495 



=^j5210> 245 

J'l<211> 884 

i <212> DNA 

d!<213> Homo sapiens 

,^|^<220> 
l^U221> CDS 
;;^|<222> 21. .614 

^^^<221> sig_jpeptide 
<222> 21. .83 
<223> Von Heijne matrix 
score 10 

seq LWALAMVTRPASA/AP 

<221> polyA_signal 
<222> 849. .854 



<221> polyA_site 
<222> 873 . .884 



<400> 245 

aataccttag acccteagtc atg eea gtg cct get ctg tgc ctg etc tgg gcc 53 

Met Pro Val Pro Ala Leu Cys Leu Leu Trp Ala 
-20 -15 
ctg gca atg gtg ace egg cct gcc tea gcg gee ccc atg ggc ggc eea 101 
Leu Ala Met Val Thr Arg Pro Ala Ser Ala Ala Pro Met Gly Gly Pro 
-10 -5 15 

gaa ctg gca cag cat gag gag ctg acc ctg etc ttc cat ggg acc ctg 149 



-183- 



Glu Leu Ala Gin His Glu Glu Leu Thr Leu Leu Phe His Gly Thr Leu 

10 15 20 

cag ctg ggc cag gcc etc aac ggt gtg tac agg acc acg gag gga egg 197 
Gin Leu Gly Gin Ala Leu Asn Gly Val Tyr Arg Thr Thr Glu Gly Arg 

25 30 35 

ctg aea aag gcc agg aac age ctg ggt etc tat ggc cgc aca ata gaa 245 
Leu Thr Lys Ala Arg Asn Ser Leu Gly Leu Tyr Gly Arg Thr lie Glu 

40 45 50 

etc ctg ggg cag gag gtc age egg ggc egg gat gca gee cag gaa ctt 293 
Leu Leu Gly Gin Glu Val Ser Arg Gly Arg Asp Ala Ala Gin Glu Leu 
55 60 65 70 

egg gca age ctg ttg gaa act car atg gag gag gat att ctg cas ctg 341 
Arg Ala Ser Leu Leu Glu Thr Gin Met Glu Glu Asp lie Leu Xaa Leu 

75 80 85 

cag gca rag gee aca get gag gtg ctg ggg gag gtg gee cag gca car 389 
Gin Ala Xaa Ala Thr Ala Glu Val Leu Gly Glu Val Ala Gin Ala Gin 

90 95 100 

^ag gtg eta egg gac age gtg cag egg eta daa kte cag ctg arg asc 437 
Lys Val Leu Arg Asp Ser Val Gin Arg Leu Xaa Xaa Gin Leu Xaa Xaa 

105 110 115 

gee tgg ctg ggc ect gcc tac ega aaa ttt gar gtc tta aag gey eee 485 
^.-^l^la Trp Leu Gly Pro Ala Tyr Arg Lys Phe Glu Val Leu Lys Ala Pro 

120 125 130 

f^ck gam aar car aac cae ate eta tgg gcc etc aca ggc cae gtg cak 533 
:^;;l>ro Xaa Lys Gin Asn His lie Leu Trp Ala Leu Thr Gly His Val Xaa 
|"J35 140 145 150 

Hp99 car arg egg gar atg gtg gca cag cag cwt ekg ctg cna car ate 581 
jl^Arg Gin Xaa Arg Glu Met Val Ala Gin Gin Xaa Xaa Leu Xaa Gin lie 

155 160 165 

y'bag gar aaa etc cae aea geg geg etc eca gcc tgaatctgce tggatggaac 634 
- Gin Glu Lys Leu His Thr Ala Ala Leu Pro Ala 
hh 170 175 

pjtgaggaecaa tcatgctgea aggaacactt ccacgeceeg tgaggceeet gtgeagggag 694 
LiPagctgectg ttcaetggga teagceaggg egeegggeec cacttctgag cacagagear 754 
pgacagacge aggeggggac aaaggeagag gatgtageec cattggggag gggtggagga 814 
,j^^ggacatgta ecctttcatr metacacace cetcattaaa gcavagtegt ggeateteaa 874 
'-'Haaaaaaaaa 884 



<210> 246 
<211> 897 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 94 . .573 

<221> sig_peptide 

<222> 94. .258 

<223> Von Heijne matrix 

score 4.69999980926514 
seq IGILCSLLGTVLL/WV 

<221> polyA_signal 
<222> 862. .867 



-184- 



<221> polyA_site 
<222> 886. .897 

<400> 246 

SLagggcggct gcctagcacc cggaagagcc gtcaacttag cgagcgcaac aggctgccgc 60 
tgaggagctg gagctggtgg ggactgggcc gca atg gac aag ctg aag aag gtg 114 

Met Asp Lys Leu Lys Lys Val 
-55 -50 



ctg 


age 


ggg 


cag 


gac 


acg 


gag 


gac 


egg 


age 


ggc 


ctg 


tec 


gag 


gtt 


gtt 


162 


Leu 


Ser 


Gly 


Gin 
-45 


Asp 


Thr 


Glu 


Asp 


Arg 
-40 


Ser 


Gly 


Leu 


Ser 


Glu 
-35 


Val 


Val 




gag 


gca 


tct 


tea 


tta 


age 


tgg 


agt 


ace 


agg 


ata 


aaa 


ggc 


ttc 


att 


gcg 


210 


Glu 


Ala 


Ser 
-30 


Ser 


Leu 


Ser 


Trp 


Ser 
-25 


Thr 


Arg 


He 


Lys 


Gly 
-20 


Phe 


He 


Ala 




tgt 


ttt 


get 


ata 


gga 


att 


etc 


tgc 


tea 


ctg 


ctg 


ggt 


act 


gtt 


ctg 


ctg 


258 


Cys 


Phe 
-15 


Ala 


He 


Gly 


He 


Leu 
-10 


Cys 


Ser 


Leu 


Leu 


Gly 
-5 


Thr 


Val 


Leu 


Leu 




tgg 


gtg 


ccc 


agg 


aag 


gga 


eta 


eae 


etc 


ttc 


gca 


gtg 


ttt 


tat 


ace 


ttt 


306 


Trp 


Val 


Pro 


Arg 


Lys 


Gly 


Leu 


His 


Leu 


Phe 


Ala 


Val 


Phe 


Tyr 


Thr 


Phe 




1 








5 










10 










15 






r^St 


aat 


ate 


gca 


tea 


att 


ggg 


agt 


ace 


ate 


ttc 


etc 


atg 


gga 


cca 


gtg 


354 


:S?31y 


Asn 


He 


Ala 
20 


Ser 


He 


Gly 


Ser 


Thr 
25 


He 


Phe 


Leu 


Met 


Gly 
30 


Pro 


Val 




r^^aa 


cag 


ctg 


aag 


ega 


atg 


ttt 


gag 


cct 


act 


cgt 


ttg 


att 


gca 


act 


ate 


402 




Gin 


Leu 
35 


Lys 


Arg 


Met 


Phe 


Glu 
40 


Pro 


Thr 


Arg 


Leu 


He 
45 


Ala 


Thr 


He 




=|iHtg 


gtg 


ctg 


ttg 


tgt 


ttt 


gca 


ett 


ace 


ctg 


tgt 


tct 


gee 


ttt 


tgg 


tgg 


450 


Uket 


Val 
50 


Leu 


Leu 


Cys 


Phe 


Ala 
55 


Leu 


Thr 


Leu 


Cys 


Ser 
60 


Ala 


Phe 


Trp 


Trp 




cat 


aac 


aag 


gga 


ett 


gca 


ett 


ate 


ttc 


tgc 


att 


ttg 


cag 


tct 


ttg 


gca 


498 




Asn 


Lys 


Gly 


Leu 


Ala 


Leu 


He 


Phe 


Cys 


He 


Leu 


Gin 


Ser 


Leu 


Ala 




H^5 










70 










75 










80 




i>tg 


acg 


tgg 


tac 


age 


ett 


tec 


ttc 


ata 


eca 


ttt 


gca 


agg 


gat 


get 


gtg 


546 


[. fj^^ 


Thr 


Trp 


Tyr 


Ser 


Leu 


Ser 


Phe 


He 


Pro 


Phe 


Ala 


Arg 


Asp 


Ala 


Val 












85 










90 










95 






^Jj|kaa 


aad 


tgt 


ttt 


gee 


gtg 


tgt 


ett 


gca 


taatteatgg ecagttttat 




593 


i^tys 


Xaa 


Cys 


Phe 
100 


Ala 


Val 


Cys 


Leu 


Ala 
105 



















gaagctttgg aaggeactat ggaeagaage tggtggaeag ttttgtwact atettcgaaa 653 

cctetgtctt acagacatgt gecttttate ttgcageaat gtgttgcttg tgattcgaae 713 

atttgagggt taettttgga agcaacaata cattctcgaa ectgaatgte agtagcaeag 773 

gatgagaagt gggttctgta tcttgtggag tggaatcttc etcatgtacc tgtttcctct 833 

ctggatgttg teeeactgaa ttcccatgaa tacaaaccta tteagcaaca gcaaaaaaaa 893 

aaaa 397 



<210> 247 
<211> 518 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 74. .397 



-185- 

<221> sig_peptide 

<222> 74. .127 

<223> Von Heijne matrix 

score 7.69999980926514 

seq LLLLPVLGLLVSS/KT 

<221> polyA_signal 
<222> 472 . .477 

<221> polyA_site 
<222> 507. .5X8 

<400> 247 

aaagaaagag ctgcsgtgca ggaattcgtg tgccggattt ggttagctga gcccaccgag 60 
aggcgcctgc agg atg aaa get etc tgt etc etc etc etc cct gtc etg 109 
Met Lys Ala Leu Cys Leu Leu Leu Leu Pro Val Leu 
-15 -10 



ggg 


ctg 


ttg 


gtg 


tet 


age 


aag 


ace 


ctg 


tgc 


tee 


atg 


gaa 


gaa 


gcc 


ate 


157 


Gly 


Leu 
-5 


Leu 


Val 


Ser 


Ser 


Lys 
1 


Thr 


Leu 


Cys 


Ser 
5 


Met 


Glu 


Glu 


Ala 


He 
10 




aat 


gag 


agg 


ate 


eag 


gag 


gtc 


gcc 


ggc 


tec 


eta 


ata 


ttt 


agg 


gca 


ata 


205 


r='|^sn 


Glu 


Arg 


He 


Gin 
15 


Glu 


Val 


Ala 


Gly 


Ser 
20 


Leu 


He 


Phe 


Arg 


Ala 
25 


He 




i;f%gc 


age 


att 


ggc 


cga 


ggg 


age 


gag 


age 


gtc 


ace 


tee 


agg 


ggg 


gae 


ctg 


253 




Ser 


lie 


Gly 


Arg 


Gly 


Ser 


Glu 


Ser 


Val 


Thr 


Ser 


Arg 


Gly 


Asp 


Leu 










30 










35 










40 








^l&ct 


act 


tge 


ccc 


cga 


ggc 


ttc 


gee 


gtc 


ace 


ggc 


tge 


act 


tgt 


ggc 


tec 


301 


;mia 


Thr 


Cys 


Pro 


Arg 


Gly 


Phe 


Ala 


Val 


Thr 


Gly Cys 


Thr 


Cys 


Gly 


Ser 








45 










50 










55 










vjgcc 


tgt 


ggc 


teg 


tgg 


gat 


gtg 


ege 


gcc 


gag 


acc 


aea 


tgt 


eac 


tgc 


eag 


349 


.1 Ala 


Cys 
60 


Gly 


Ser 


Trp 


Asp 


Val 
65 


Arg 


Ala 


Glu 


Thr 


Thr 
70 


Cys 


His 


Cys 


Gin 




11 tgc 


gcg 


ggc 


atg 


gae 


tgg 


ace 


gga 


gcg 


cgc 


tge 


tgt 


egt 


gtg 


eag 


ccc 


397 


i,^;Cys 


Ala 


Gly 


Met 


Asp 


Trp 


Thr 


Gly 


Ala 


Arg 


Cys 


Cys 


Arg 


Val 


Gin 


Pro 




?5 










80 










85 










90 





.,;|^gaggtegeg cgeagcgcgt geacagegeg ggcggaggcg gctccaggte cggaggggtt 457 
::J;^cgggggagc tggaaataaa ectggagatg atgatgatga tgatgatgga aaaaaaaaaa 517 



518 



<210> 248 
<211> 350 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 51. .242 

<221> sig__peptide 
<222> 51. .116 
<223> Von Heijne matrix 
score 6 . 5 

seg SCLCPALFPGTSS/FI 
<221> polyA_signal 



-186- 



<222> 319. ,324 



<221> polyA_site 
<222> 339. .350 



<400> 248 

acgtcattcc aaaaccacac ccttgcaaag ctttgtactc cgcaccccag atg ate 56 

Met He 

tec agg cag etc aga tot ctt tec tge ett tgc ect gca etg ttc cee 104 

Ser Arg Gin Leu Arg Ser Leu Ser Cys Leu Cys Pro Ala Leu Phe Pro 

-20 -15 -10 -5 

ggt act tee tec ttt att gta gca etc age tec cea gee gat etg tae 152 

Gly Thr Ser Ser Phe He Val Ala Leu Ser Ser Pro Ala Asp Leu Tyr 

15 10 
ate ect cav agg cas ega tet gat gaa ttg gtt ttt gaa tec car aaa 200 
He Pro Xaa Arg Xaa Arg Ser Asp Glu Leu Val Phe Glu Ser Gin Lys 

15 20 25 

ggg tet gee atg gag ttg gca gtc ate aeg gta rat gge gta 242 
Gly Ser Ala Met Glu Leu Ala Val He Thr Val Xaa Gly Val 

30 35 40 

tgattttgct gaattttaaa taaaatgaaa accataaatt acatratget tttattgach 302 
=^pttgacmact ggcctaaata aaaaractet gaetccaaaa aaaaaaaa 350 



;';|^210> 249 

y%211> 996 

&212> DNA 

;J^|c213> Homo sapiens 

: <220> 
,j<221> CDS 
^^<222> 111. .191 

<221> sigjpeptide 
''^^<222> 111. .155 
jjc223> Von Heijne matrix 
]!| score 5.80000019073486 

seq FLXLMTLTTHVHS/SA 



<221> polyA_signal 
<222> 965. . 970 



<221> polyA_site 
<222> 986. .996 



<400> 249 

atecgataca gaacatgcag taatgtggac tgeeeaccag aagcaggtga tttccgagct 60 
cagcaatgct cagetcataa tgatgtcaag caecatggcc agttttatga atg ggy 116 

Met Gly 
-15 

ttc etg wgt eta atg ace etg aea acc cat gtt cac tea agt gee aag 164 
Phe Leu Xaa Leu Met Thr Leu Thr Thr His Val His Ser Ser Ala Lys 

-10 -5 1 

cea aat gaa caa ecc tgg ttg ttg aac tageaectaa ggtcttarat 211 
Pro Asn Glu Gin Pro Trp Leu Leu Asn 
5 10 



-187- 



ggtacgcgtt gctatacaga atctttggat atgtgcatca gtggtttatg ccaaattgtt 271 

ggctgcgatc accagctggg aagcaccgtc aaggaarata actgtggggt ctgcaacrga 331 

natgggtcca cctgccggct ggtccgaggg cartataaat cccakctctc cgcaaccaaa 391 

tcrgatgata ctgtggttgc aattccctat ggaagtakac atattcgcct tgtcttaaaa 451 

ggtcctgatc acttatatct ggaarccawa accctccagg ggactaawgg tgaaaacagt 511 

ctcasctcca caggaacttt ccttgtggac aattctagtg tggacttcca gaawtttcca 571 

gacwdagaga tactgagaat ggctggacca ctcacagcag atttcattgt caawattcgt 631 

aactcgggct ccgctgacag tacagtccag kkcatcttct atcaacccat catccaccga 691 

tggagggara cggatttctt tccttgctca gcaacctgtg gaggaggtta tcagctgaca 751 

tcggctgagt gctacgatct gaggagcaac cgtgtggttg ctgaccaata ctgtcactat 811 

tacccagaga acatcaaacc caaacccaag cttcaggagt gcaacttgga tccttgtcca 871 

gccaggtcag tcaaatttgc tagttcattt gtcataaaca taactcaagt tccaaatagg 931 

ttatttaaat taaaatgaaa cgttttaatt aaaaataaaa tgaaattaaa catcaaaaaa 991 

aaaaa 996 



<210> 250 
<211> 860 
<212> DNA 
<213> Homo sapiens 

^220> 
jf'^c221> CDS 
;^;^222> 45. .602 

^'[k221> sigjjeptide 

^f<222> 45. .107 

|J,ik223> Von Heijne matrix 

in score 8.5 

^: seq LLTIVGLILPTRG/QT 

|-^?=221> polyA_signal 
yU222> 828. .833 

'^';^221> polyA_site 
^lil:222> 850. .860 

<400> 250 

acctctctcc acgaggctgc cggcttagga cccccagctc cgac atg teg ccc tct 56 

Met Ser Pro Ser 
-20 

ggt cgc ctg tgt ctt etc acc ate gtt ggc ctg att etc ccc ace aga 104 
Gly Arg Leu Cys Leu Leu Thr lie Val Gly Leu lie Leu Pro Thr Arg 

-15 -10 -5 

gga cag acg ttg aaa gat acc acg tec agt tct tea gca gae tea act 152 
Gly Gin Thr Leu Lys Asp Thr Thr Ser Ser Ser Ser Ala Asp Ser Thr 

15 10 15 

ate atg gae att eag gtc eeg aca ega gee cca gat gca gtc tae aca 200 
lie Met Asp lie Gin Val Pro Thr Arg Ala Pro Asp Ala Val Tyr Thr 

20 25 30 

gaa etc cag eee acc tct cca ace cca acc tgg cet get gat gaa aca 248 
Glu Leu Gin Pro Thr Ser Pro Thr Pro Thr Trp Pro Ala Asp Glu Thr 

35 40 45 

cca caa ccc cag acc cag acc eag caa ctg gaa gga acg gat ggg cet 296 
Pro Gin Pro Gin Thr Gin Thr Gin Gin Leu Glu Gly Thr Asp Gly Pro 

50 55 60 



-188- 



eta 


gtg 


aca 


gat 


cca 


gag 


aca 


cac 


wak 


age 


mcc 


aaa 


gca 


get 


eat 


ccc 


344 


Leu 


Val 


Thr 


Asp 


Pro 


Glu 


Thr 


His 


Xaa 


Ser 


Xaa 


Lys 


Ala 


Ala 


His 


Pro 






65 










70 










75 












act 


gat 


gac 


acc 


acg 


acg 


etc 


tct 


gag 


aga 


cca 


tec 


cca 


age 


aca 


kac 


392 


Thr 


Asp 


Asp 


Thr 


Thr 


Thr 


Leu 


Ser 


Glu 


Arg 


Pro 


Ser 


Pro 


Ser 


Thr 


Xaa 




80 










85 










90 










95 




gtc 


cat 


dac 


aga 


ccb 


cba 


kda 


ccc 


tea 


ake 


eat 


etg 


gtt 


ttc 


atg 


agg 


440 


Val 


His 


Xaa 


Arg 


Pro 


Xaa 


Xaa 


Pro 


Ser 


Xaa 


His 


Leu 


Val 


Phe 


Met 


Arg 












100 










105 










110 






atg 


acc 


cct 


tct 


tct 


atg 


atg 


aac 


aca 


ccc 


tee 


gga 


aac 


sgg 


ggc 


tgt 


488 


Met 


Thr 


Pro 


Ser 


Ser 


Met 


Met 


Asn 


Thr 


Pro 


Ser 


Gly 


Asn 


Xaa 


Gly 


Cys 





115 120 125 

tgg teg cag etg tgc tgt tea tea eag gca tea tea tec tea cca gtg 536 
Trp Ser Gin Leu Cys Cys Ser Ser Gin Ala Ser Ser Ser Ser Pro Val 

130 135 140 

gca agt gca gge age tgt ccc ggt tat gee gga ate att gca ggt gag 584 
Ala Ser Ala Gly Ser Cys Pro Gly Tyr Ala Gly lie lie Ala Gly Glu 

145 150 155 

tec ate aga aac agg age tgacaacctg ctgggcaccc gaagaccaag 632 
Ser lie Arg Asn Arg Ser 
160 165 
;;peeeetgeea gctcaccgtg eecagcetec tgcateeeet cgaagageet ggceagagag 692 
V^gaagaeaea gatgatgaag ctggagceag ggctgeeggt eegagtctcc tacetceccc 752 
■^'kaccctgeee geecctgaag gctacetggc gccttggggg etgtccctca agttatetcc 812 
"ictgctaaga caaaaagtaa agcactgtgg tetttgcaaa aaaaaaaa 860 



^j<210> 251 

j1<:211> 593 

I <212> DNA 

;!;i:<213> Homo sapiens 

J^220> 
^"<221> CDS 

'j^222> 24. .560 

it J? 

^^221> sig_peptide 
<222> 24, .101 
<223> Von Heijne matrix 

score 10.3999996185303 

seq LLLLLCGPSQDQC/RP 

<221> polyA_signal 
<222> 563. .568 

<221> polyA_site 
<222> 583. .593 

<400> 251 

aanccagctg csgecggeca gee atg gag act gga gcg etg egg egc ccg caa 53 

Met Glu Thr Gly Ala Leu Arg Arg Pro Gin 
-25 -20 
ctt etc ccg ttg etg etg etg etc tgc gge cct tec eag gat caa tgc 101 
Leu Leu Pro Leu Leu Leu Leu Leu Cys Gly Pro Ser Gin Asp Gin Cys 

-15 -10 -5 

ega cct gta etc cag aat etg ttg eag age cca gge ttg aca tgg age 149 



-189- 



Arg 
1 


Pro 


Val 


Leu 


Gin 
5 


Asn 


Leu 


Leu 


Gin 




gaa 


gtg 


ccc 


act 


ggg 


aga 


gaa 


gga 


Leu 


Vj±U 


vai 


Pro 


Thr 


vj±y 


Arg 


Glu Gly 








20 










25 


cca. 




eta 


akt 


ggg 


gee 


act 


cca 


gee 


Pro 


(jly 


Leu 


Xaa Gly 


TV 1 

Ala 


Thr 


Pro 


Ala 






35 










40 




aug 


999 


aga 


caa 


agg 


aee 


aga 


aag 


gtg 


Met 


Gly 


Arg 


Gin Arg 


Thr 


Arg 


Lys 


Val 




50 










55 






aca 


gca 


aat 


cag 


gaa 


eta 


aac 


agg 


atg 


Thr 


Ala 


Asn 


Gin 


Glu 


Leu 


Asn 


Arg Met 


65 










70 








gtg 


cca 


gtg 


ggg 


cat 


ctg 


gag 


ggt 


gge 


Val 


Pro 


Val 


Gly His 


Leu 


Glu 


Gly Gly 










85 










acg 


ggc 


etc 


cat 


tec 


tgc 


ara 


gat 


ggt 


Thr 


Gly 


Leu 


His 


Ser 


Cys 


Xaa 


Asp Gly 








100 










105 


cca 


get 


tea 


gtc 


ctg 


get 


gat 


get 


tgc 


=^^Pro 


Ala 


Ser 


Val 


Leu 


Ala 


Asp 


Ala 


Cys 






115 










120 






car 


arg 


gee 


eta 


ttt 


ggg 


tta 


agt 


■;Vai 


Gin 


Xaa 


Ala 


Leu 


Phe 


Gly 


Leu 


Ser 




130 










135 






^ kcc 


cac 


ttc 


tge 


ett 


tct 


att 


ana 


ctt 


bThr 


His 


Phe 


Cys 


Leu 


Ser 


He 


Xaa 


Leu 


^,1145 










150 









IJIgtaaaaaaaa aaa 



Ser 


Pro 


Gly 


Leu 


Thr 


Trp 


Ser 




10 










15 






aag 


gaa 


ggt 


ggg 


gat 


egg 


gga 


197 


Lys 


Glu 


Gly 


Gly 


Asp 


Arg Gly 












30 








agg 


age 


ect 


eag 


gge 


aag 


gag 


245 


Arg 


Ser 


Pro 


Gin 
45 


Gly 


Lys 


Glu 




aag 


gge 


cct 


get 


tgg 


akt 


cac 


293 


Lys 


Gly 


Pro 
60 


TV T — 

Ala 


Trp 


Xaa 


His 




agg 


tet 


ctg 


tet 


tet 


gge 


tec 


341 


Arg 


Ser 
75 


Leu 


Ser 


Ser 


Gly 


Ser 
80 




acg 


gtc 


aag 


ett 


cag 


aag 


gae 


389 


Thr 


Val 


Lys 


Leu 


Gin 


Lys 


Asp 




90 










95 






atg 


get 


tct 


ett 


gaa 


ggg 


aeg 


437 


Met 


Ala 


Ser 


Leu 


Glu 


Gly Thr 












110 








cca 


gga 


ttc 


cat 


gat 


gtg 


aan 


485 


Pro 


Gly 


Phe 


His 
125 


Asp 


Val 


Xaa 




ggg 


ana 


rta 


ctg 


tgg 


ctg 


aaa 


533 


Gly 


Xaa 


Xaa 
140 


Leu 


Trp 


Leu 


Lys 





taaataaact ctgaaracct 580 



593 



[jr<210> 252 

^''J<211> 1114 

';j<212> DNA 

;;?f<213> Homo sapiens 

<220> 

<221> CDS 

<222> 109. .558 



<221> sig_peptide 

<222> 109. .273 

<223> Von Heijne matrix 

score 3.70000004768372 
seq VAFMLTLPILVCK/VQ 



<221> polyA__site 
<222> 1104. .1114 



<400> 252 

attagctstc caaggtctcc cccagcactg aggagctcgc ctgctgccet cttgcgcgcg 60 
ggaagcagea ecaagttcac ggeeaacgee ttggcaetag ggteeaga atg get aea 117 

Met Ala Thr 
-55 

aea gtc ect gat ggt tge egc aat ggc ctg aaa tec aag tac tac aga 165 



-190- 



Thr Val Pro Asp Gly Cys Arg Asn Gly Leu Lys Ser Lys Tyr Tyr Arg 

-50 -45 -40 

ctt tgt gat aag get gaa get tgg ggc ate gtc eta gaa acg gtg gee 213 
Leu Cys Asp Lys Ala Glu Ala Trp Gly lie Val Leu Glu Thr Val Ala 

-35 -30 -25 

aca gee ggg gtt gtg ace teg gtg gee ttc atg etg aet ete ccg ate 261 
Thr Ala Gly Val Val Thr Ser Val Ala Phe Met Leu Thr Leu Pro lie 
-20 -15 -10 -5 

etc gtc tge aag gtg cag gae tee aae agg cga aaa atg etg cct act 3 09 

Leu Val Cys Lys Val Gin Asp Ser Asn Arg Arg Lys Met Leu Pro Thr 

15 10 
cag ttt etc ttc etc etg ggt gtg ttg gge ate ttt ggc etc ace ttc 357 
Gin Phe Leu Phe Leu Leu Gly Val Leu Gly lie Phe Gly Leu Thr Phe 

15 20 25 

gee ttc ate ate gga etg gae ggg age aca ggg ecc aca cgc ttc ttc 405 
Ala Phe lie lie Gly Leu Asp Gly Ser Thr Gly Pro Thr Arg Phe Phe 

30 35 40 

etc ttt ggg ate ete ttt tec ate tge ttc tec tgc etg etg get eat 453 
Leu Phe Gly lie Leu Phe Ser lie Cys Phe Ser Cys Leu Leu Ala His 
45 50 55 60 

get gtc agt etg ace aag etc gtc egg ggg agg aaa gee cct ttc cct 501 
|;;^f^la Val Ser Leu Thr Lys Leu Val Arg Gly Arg Lys Ala Pro Phe Pro 
Tfk 65 70 75 

j^gtt ggt gat tet ggg tet ggc cgt ggg ctt eag cct agt cca gga tgt 549 
J^^al Gly Asp Ser Gly Ser Gly Arg Gly Leu Gin Pro Ser Pro Gly Cys 

80 85 90 

-jfeat cgc tat tgaatatatt gtcctgaeea tgaataggae eaacgtcaat 598 
==I^Tyr Arg Tyr 
95 

ygtcttttetg agetttecgc tectcgtege aatgaaaaet ttgtcctcct gcteacetac 658 
u ktcctcttct tgatggcgct gacctteetc wtgtccteet tcaccttetg tggtkccttc 718 
^..pegggctgga avagaeatgg ggcccaeatc taccteasga tgctcskete cattgceate 778 
fif 59"gtggcct ggatcaccct gctcatgctt cctgactttg accgcrggtg ggatgacacc 838 
^^'^temtcarct eegecttggs tresaatggc tgggtgttcc tgttggctta tgttagtccc 898 
gagttttgge tgctcacaaa gcaackaaac eccatggatt ateetgttga ggatgettte 958 
'Jtgtaaaeetc aactcgtgaa gaagagctat ggtgtggrga acagageeta skctcaagag 1018 
Jji^aaatcactc aaggttttga agagaeaggg gaeaegctct atgeecccta tteeacacat 1078 
^itttcagetgc agaaseagec tecccaaaaa aaaaaa 1114 



<210> 253 
<211> 1182 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 128. .835 

<221> sig_peptide 

<222> 128 . .220 

<223> Von Heijne matrix 

score 4.69999980926514 
seq LAVDSWWLDPGHA/AV 



<221> polyA_signal 



-191- 

<222> 1145. ,1150 

<221> polyA__site 

<222> 1170. .1181 
<400> 253 

aagaactgcg tctcgcgacc caggcgcggg ttcccggagg acagccaaca agcgatgctg 60 
ccgccgccgt ttcctgattg gttgtgggtg gctacctctt cgttctgatt ggccgctagt 120 



gagcaag 


atg 


ctg 


age 


aag 


ggt 


ctg 


aag 


egg 


aaa 


egg 


gag 


gag 


gag 


gag 


169 






Met 


Leu 
-30 


Ser 


Lys 


Gly 


Leu 


Lys 
-25 


Arg 


Lys 


Arg 


Glu 


Glu 
-20 


Glu 


Glu 




gag 


aag 


gaa 


ect 


ctg 


gca 


gte 


gac 


tee 


tgg 


tgg 


eta 


gat 


ect 


ggc 


cac 


217 


Glu 


Lys 


Glu 
-15 


Pro 


Leu 


Ala 


Val 


Asp 
-10 


Ser 


Trp 


Trp 


Leu 


Asp 
-5 


Pro 


Gly 


His 




gca 


gcg 


gtg 


gca 


eag 


gca 


ece 


ecg 


gee 


gtg 


gee 


tet 


age 


tec 


etc 


ttt 


265 


Ala 


Ala 
1 


Val 


Ala 


Gin 


Ala 
5 


Pro 


Pro 


Ala 


Val 


Ala 
10 


Ser 


Ser 


Ser 


Leu 


Phe 
15 




gac 


etc 


tea 


gtg 


etc 


aag 


etc 


eae 


eae 


age 


ctg 


eag 


vrr 


agt 


rag 


ecg 


313 


Asp 


Leu 


Ser 


Val 


Leu 
20 


Lys 


Leu 


His 


His 


Ser 
25 


Leu 


Gin 


Xaa 


Ser 


Xaa 
30 


Pro 




gac 


ctg 


egg 


eae 


ctg 


gtg 


ctg 


gte 


atr 


aac 


act 


ctg 


egg 


ege 


ate 


eag 


361 




Leu 


Arg 


His 


Leu 


Val 


Leu 


Val 


Xaa 


Asn 


Thr 


Leu 


Arg 


Arg 


He 


Gin 










35 










40 










45 








■;;%cg 


tec 


atg 


gca 


cec 


gcg 


get 


gee 


ctg 


eca 


ect 


gtg 


ect 


ace 


eca 


ect 


409 




Ser 


Met 


Ala 


Pro 


Ala 


Ala 


Ala 


Leu 


Pro 


Pro 


Val 


Pro 


Thr 


Pro 


Pro 








50 










55 










60 












gee 


cec 


ant 


gtg 


get 


gac 


aac 


tta 


ctg 


gca 


age 


teg 


gac 


get 


gee 


457 


^|*Ala 


Ala 


Pro 


Xaa 


Val 


Ala 


Asp 


Asn 


Leu 


Leu 


Ala 


Ser 


Ser 


Asp 


Ala 


Ala 




US 


65 










70 










75 












yttt 


tea 


gee 


tec 


atg 


gee 


arm 


etc 


ctg 


gar 


gac 


etc 


age 


cac 


att 


gag 


505 


j: Leu 


Ser 


Ala 


Ser 


Met 


Ala 


Xaa 


Leu 


Leu 


Glu 


Asp 


Leu 


Ser 


His 


He 


Glu 














85 










90 










95 




: ,.Gly 


ctg 


agt 


eag 


get 


cec 


eaa 


ece 


ttg 


gca 


gac 


gag 


ggg 


eca 


eca 


ggc 


553 


Leu 


Ser 


Gin 


Ala 


Pro 


Gin 


Pro 


Leu 


Ala 


Asp 


Glu 


Gly 


Pro 


Pro 


Gly 












100 










105 










110 






''t;gt 


age 


ate 


ggg 


gga 


wea 


ecg 


cec 


ame 


ctg 


ggt 


gee 


ttg 


gae 


ctg 


ctg 


601 




Ser 


He 


Gly 


Gly 


Xaa 


Pro 


Pro 


Xaa 


Leu 


Gly 


Ala 


Leu 


Asp 


Leu 


Leu 










115 










120 










125 








ggc 


eca 


gee 


act 


ggc 


tgt 


eta 


ctg 


gac 


aat 


ggg 


ett 


gag 


ggc 


ctg 


ttt 


649 


Gly 


Pro 


Ala 
130 


Thr 


Gly 


Cys 


Leu 


Leu 
135 


Asp 


Asn 


Gly 


Leu 


Glu 
140 


Gly 


Leu 


Phe 




gag 


gat 


att 


gac 


ace 


tct 


atg 


tat 


gac 


aat 


gaa 


ett 


tgg 


gca 


eca 


gee 


697 


Glu 


Asp 
145 


He 


Asp 


Thr 


Ser 


Met 
150 


Tyr 


Asp 


Asn 


Glu 


Leu 
155 


Trp 


Ala 


Pro 


Ala 




tct 


gag 


ggc 


etc 


aaa 


eca 


ggc 


ect 


gag 


gat 


ggg 


ecg 


ggc 


aag 


gag 


gaa 


745 


Ser 


Glu 


Gly 


Leu 


Lys 


Pro 


Gly 


Pro 


Glu 


Asp 


Gly 


Pro 


Gly 


Lys 


Glu 


Glu 




160 










165 










170 










175 




get 


ecg 


gag 


ctg 


gac 


gag 


gee 


gaa 


ttg 


gac 


tae 


etc 


atg 


gat 


gtg 


ctg 


793 


Ala 


Pro 


Glu 


Leu 


Asp 


Glu 


Ala 


Glu 


Leu 


Asp 


Tyr 


Leu 


Met 


Asp 


Val 


Leu 





180 185 190 

gtg ggc aea eag gea ctg gag cga ecg ceg ggg eca ggg cgc 835 
Val Gly Thr Gin Ala Leu Glu Arg Pro Pro Gly Pro Gly Arg 
195 200 205 

tgagccctcg tgctggaatg gttgtctggt atctgaactg agcctgctgg ctggaccaac 895 
tgtectegaa aagacacagc tggettcect agtacagaga acagggettg ggccaetttg 955 
gagagaeaga atctagtcct gggcaactte acateegtcc tcctgtetea gggctggeag 1015 
ggggagcctg gaattaeecc ctagtgatgg aatgaeaggg tctggtgggg aetgaattcc 1075 



-192- 



ctggccctgg ggtcatagct tgggctgttc cttctctgat acgggaagag acccaatcag 1135 
atttttcaaa ttaaagccag tcctgggaaa tctcaaaaaa aaaaaac 1182 



<210> 254 

<211> 1073 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 59. .505 

<221> sig_peptide 

<222> 59. .358 

<223> Von Heijne matrix 

score 3.70000004768372 

seq LASSFLFTMGGLG/FI 

<221> polyA__signal 
f^22> 1042. .1047 

^^<221> polyA^site 

r;:<222> io62. .1073 

^'|^l:400> 254 

4'actgtttnng ggaggcgcgt ggggcttgag gccgagaacg gcccttgctg ccaccaac 58 

yiktg gag act ttg tac cgt gtc ccg ttc tta gtg etc gaa tgt ccc aac 106 

y'ket Glu Thr Leu Tyr Arg Val Pro Phe Leu Val Leu Glu Cys Pro Asn 
Si -100 -95 -90 -85 

j.,|Ptg aag ctg aag aag ccg ccc tgg ttg cac atg ccg teg gcc atg act 154 

|i|ljeu Lys Leu Lys Lys Pro Pro Trp Leu His Met Pro Ser Ala Met Thr 
JT -80 -75 -70 

'/gtg tat get ctg gtg gtg gtg tet tac ttc etc ate ace gga gga ata 202 

';^al Tyr Ala Leu Val Val Val Ser Tyr Phe Leu lie Thr Gly Gly He 

l;:;f -^o -55 

Uiktt tat gat gtt att gtt gaa cet eca agt gtc ggt tet atg act gat 250 

He Tyr Asp Val He Val Glu Pro Pro Ser Val Gly Ser Met Thr Asp 

-50 -45 -40 

gaa eat ggg cat cag agg eca gta get ttc ttg gee tac aga gta aat 298 

Glu His Gly His Gin Arg Pro Val Ala Phe Leu Ala Tyr Arg Val Asn 

-35 -30 -25 

gga caa tat att atg gaa gga ett gca tec age ttc eta ttt aca atg 346 

Gly Gin Tyr He Met Glu Gly Leu Ala Ser Ser Phe Leu Phe Thr Met 

-20 -15 -10 -5 

gga ggt tta ggt ttc ata ate ctg gae gga teg aat gca eca aat ate 394 

Gly Gly Leu Gly Phe He He Leu Asp Gly Ser Asn Ala Pro Asn He 

15 10 

eca aaa etc aat aga ttc ett ett ctg ttc att gga ttc gtc tgt gtc 442 

Pro Lys Leu Asn Arg Phe Leu Leu Leu Phe He Gly Phe Val Cys Val 

15 20 25 

eta twr agt ttt tkc ayg get aga gta ttc atg aga atg aaa ctg ccg 490 

Leu Xaa Ser Phe Xaa Xaa Ala Arg Val Phe Met Arg Met Lys Leu Pro 

30 35 40 

ggc tat ctg atg ggt tagagtgect ttgasaagaa atcagtggat actggatttg 545 
Gly Tyr Leu Met Gly 
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45 

ctcctgtcaa wgaastttta aaggctgtmc caatcctcta atatgaaatg tggaaaagaa 605 

tgaagagcag cagtaaaaga aatatctagt gaaaaaacag gaagcgtatt gaagcttgga 665 

ctagaatttc ttcttggtat taaagagaca agtttatcac agaatttttt ttcctgctgg 725 

Gctattgcta taccaatgat gttgagtggc attttctttt tagtttttca ttaaaatata 785 

ttccatatct acaactataa tatcaaataa agtgattatt ttttacaacc ctcttaacat 845 

tttttggaga tgacatttct gattttcaga aattaacata aaatccagaa gcaagattcc 905 

gtaagctgag aactctggac agttgatcag ctttacctat ggtgctttgc ctttaactag 965 

agtgtgtgat ggtagattat ttcagatatg tatgtaaaac tgtttcctga acaataagat 1025 

gtatgaacgg agcagaaata aatacttttt ctaattaaaa aaaaaaaa 1073 



<210> 255 

<211> 818 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 1. .207 

fj!j<221> sig_peptide 

J;!'<222> 1. .147 

p;<223> Von Heijne matrix 

score 7.59999990463257 
seq HLPFLLLLSCVGX/XP 

l^^221> polyA^signal 
ifk222> 784 . .789 

U^<221> polyA_site 
n^222> 807. .818 



^^^400> 255 

j;^tg cct ttc cat ttt ccg ttc ctt ggg ttt gtg tgt ctg cat etc cat 48 
J||yiet Pro Phe His Phe Pro Phe Leu Gly Phe Val Cys Leu His Leu His 
tl -45 -40 -35 

Ctt acc cct tgc ctg act gta ccc cgt aga ccc ctg ttt etc etc ctg 96 
Leu Thr Pro Cys Leu Thr Val Pro Arg Arg Pro Leu Phe Leu Leu Leu 

-30 -25 -20 

cac ctg tgt ccc cat ctg ccc ttc ttg ttg etc ctg tea tgt gtc ggg 144 
His Leu Cys Pro His Leu Pro Phe Leu Leu Leu Leu Ser Cys Val Gly 

-15 -10 -5 

gkc www ccc tec tgt ctg cct tct tec tec act tgt gtc age ttg eat 192 
Xaa Xaa Pro Ser Cys Leu Pro Ser Ser Ser Thr Cys Val Ser Leu His 

15 10 15 

ttt ttt att cct gac tgagtcacca cacccctcte cectgatcaa agggaatatk 24 7 

Phe Phe lie Pro Asp 
20 

artttttaat ttggatcgac tgaggtgcca ggagaaactg cagkcccagg tatccmvaca 3 07 
gecaccagga tggtcccteg ccccaccccc acegcetctk ceccaccttt tccaacgtgt 367 
tgcatgctgg gaactggggg gtgtggggga aggggctgee ggcttcttte aggangetga 427 
rgtttggarg caaaatcaac ctgggaracc accccggceg cggcgcctca gtggacaggt 487 
gggargaaaa gaaaacttct tacettggar garggacate ccgetteett atecttagct 54 7 
tttttgttgc tcctccccac tgcccctttt aatttatttg gttgtttgcg gaaggagggg 607 
ggaagggggt aagctgggcc gggaactgte cgaggtgctg agctggggcg ggaccggaat 667 
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cctcccggta gggtaccagg gactgagttg ggcctggggc cgtgtccaag gtgccaatga 72 7 
tgcgggccga cagarcgggc cgcactgtct gtctgtccgt ctgtcccgga aagaactata 787 
aagcgctgga agcgcctgca aaaaaaaaaa a 818 



<210> 256 

<211> 971 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 12, .734 

<221> sig_peptide 

<222> 12. .101 

<223> Von Heijne matrix 

score 4.80000019073486 

seq ILFCVGAVGACTL/SV 

f.i^22X> polyA_signal 
'l^<:222> 914. .919 

J^;t221> polyA_site 
|';<222> 961. .971 

4Sc400> 256 

l,i,katacacaga a atg ggg act gcg age aga age aac ate get ege cat ctg 50 
ifl Met Gly Thr Ala Ser Arg Ser Asn lie Ala Arg His Leu 

. -30 -25 -20 

l^ipaa aee aat cte att eta ttt tgt gte ggt get gtg gge gee tgt act 98 
^^^Gln Thr Asn Leu He Leu Phe Cys Val Gly Ala Val Gly Ala Cys Thr 

;"ptc tct gte aca caa ccg tgg tac eta gaa gtg gac tac act cat gag 146 
"ipBu Ser Val Thr Gin Pro Trp Tyr Leu Glu Val Asp Tyr Thr His Glu 

^11 15 10 15 

^■^Igce gte aee ata aag tgt ace tte tec gea aee gga tge eet tct gag 194 

Ala Val Thr He Lys Cys Thr Phe Ser Ala Thr Gly Cys Pro Ser Glu 

20 25 30 

caa cca aca tgc ctg tgg ttt ege tac ggt get cae cag cct gag aac 242 
Gin Pro Thr Cys Leu Trp Phe Arg Tyr Gly Ala His Gin Pro Glu Asn 

35 40 45 

ctg tgc ttg gac ggg tgc aaa agt gag gea gas aag tte aca gtg agg 290 

Leu Cys Leu Asp Gly Cys Lys Ser Glu Ala Xaa Lys Phe Thr Val Arg 

50 55 60 

gag gcc etc aaa gaa aac caa gtt tec cte act gta aac aga gtg act 338 

Glu Ala Leu Lys Glu Asn Gin Val Ser Leu Thr Val Asn Arg Val Thr 

65 70 75 

tea aat gac agt gea att tac ate tgt gga ata gea tte ccc agt gtg 386 

Ser Asn Asp Ser Ala He Tyr He Cys Gly He Ala Phe Pro Ser Val 

80 85 90 95 

ccg gaa gcg aga get aaa cag aea gga gga ggg aee aca ctg gtg gta 434 

Pro Glu Ala Arg Ala Lys Gin Thr Gly Gly Gly Thr Thr Leu Val Val 

100 105 110 

aga gaa att aag ctg cte age aag gaa ctg egg age tte ctg aca get 482 

Arg Glu He Lys Leu Leu Ser Lys Glu Leu Arg Ser Phe Leu Thr Ala 
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115 120 125 



ctt 


gta 


tea 


ctg 


etc 


tct 


gtc 


tat 


gtg 


aec 


ggt 


gtg 


tgc 


gtg 


gee 


tte 


530 


Leu 


Val 


Ser 
130 


Leu 


Leu 


Ser 


Val 


Tyr 
135 


Val 


Thr 


Gly 


Val 


Cys 
140 


Val 


Ala 


Phe 




ata 


etc 


etc 


tec 


aaa 


tea 


aaa 


tec 


aac 


ect 


eta 


aga 


aac 


aaa 


gaa 


ata 


578 


lie 


Leu 
145 


Leu 


Ser 


Lys 


Ser 


Lys 
150 


Ser 


Asn 


Pro 


Leu 


Arg 
155 


Asn 


Lys 


Glu 


He 




aaa 


gaa 


gac 


tea 


caa 


aag 


aag 


aag 


agt 


get 


egg 


egt 


att 


ttt 


cag 


gaa 


626 


Lys 


Glu 


Asp 


Ser 


Gin 


Lys 


Lys 


Lys 


Ser 


Ala 


Arg 


Arg 


He 


Phe 


Gin 


Glu 




160 










165 










170 










175 




att 


get 


caa 


gaa 


eta 


tac 


eat 


aag 


aga 


eat 


gtg 


gaa 


aca 


aat 


cag 


caa 


674 


He 


Ala 


Gin 


Glu 


Leu 
180 


Tyr 


His 


Lys 


Arg 


His 
185 


Val 


Glu 


Thr 


Asn 


Gin 
190 


Gin 




tct 


gag 


aaa 


gat 


aac 


aac 


act 


tat 


gaa 


aac 


aga 


aga 


gta 


ctt 


tee 


aac 


722 


Ser 


Glu 


Lys 


Asp 
195 


Asn 


Asn 


Thr 


Tyr 


Glu 
200 


Asn 


Arg 


Arg 


Val 


Leu 
205 


Ser 


Asn 





tat gaa agg cca tagaaaegtt ttaattttea atgaagtcac tgaaaatcca 774 
Tyr Glu Arg Pro 
210 



aeteeaggag ctatggeagt gttaatgaac atatatcatc aggtettaaa aaaaaataaa 834 

ggtaaactga aaagaeaact ggctaeaaag aaggatgcca raatgtaagg aaactataac 894 

f-taatakteat taccaaaata etaaaaecca acaaaatgea actgaaaaat accttccaaa 954 

^^Jfcttgccaaaa aaaaaaw 971 



^{|c210> 257 

:fe211> 640 

Uk212> DISIA 

U'|:213> Homo sapiens 

L|*c220> 
pl^221> CDS 
y5222> 378 . .518 

'|:221> sig__peptide 
;;Jjj<222> 378 . .467 
Uk222> Von Heijne matrix 
score 5 . 5 

seq SLMTCTTLINASA/IS 

<221> polyA_signal 
<222> 607 . .612 

<221> polyA^site 
<222> 628. .640 



60 
120 
180 
240 
300 
360 
410 



<400> 257 

agcetgggta akgcccaaga tggctgtctt cgccttagta etegtgtgaa gttggegggg 
aeggttcctg teatcttett gggcttattt ggtgtgctgt tgaagggggg agaetagaga 
aatggeaggg aacctcttat eeggggcagg taggegectg tgggaetggg tgeetctggc 
gtgcagaage ttetctcttg gtgtgcctag attgateggt ataaggctca eteteecgcc 
ceccaaagtg gttgategtt ggaacgagaa aagggeeatg ttcggagtgt atgacaaeat 
cgggateetg ggaaaetttg aaaagcacce eaaagaaetg ateagggggc ccatatggct 
tegaggttgg aaaggga atg aat tgc aac gtt gta tec gaa aga gga aaa 

Met Asn Cys Asn Val Val Ser Glu Arg Gly Lys 
-30 -25 -20 



-196- 



tgg ttg gaa gta gaa tgt teg ctg atg acc tgc aca acc tta ata aac 458 
Trp Leu Glu Val Glu Cys Ser Leu Met Thr Cys Thr Thr Leu lie Asn 

-15 -10 -5 

gca tec get ate tet aea aac act tta acc gac atg gga agt ttc gat 506 
Ala Ser Ala lie Ser Thr Asn Thr Leu Thr Asp Met Gly Ser Phe Asp 

15 10 
aga aga gaa age tgagaacttc ggaaaagget eatctgtcac cctggaraag 558 
Arg Arg Glu Ser 
15 

ggaaaetgta ettttecetg tgaggaaacg gctttgtatt ttctctgtaa taaaatgggg 618 
cttctttgga aaaaaaaaaa aa 640 



<210> 258 

<211> 745 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
f^22> 110. .304 

.^'k221> sig__peptide 

;'!k222> 110. .193 

;''^223> Von Heijne matrix 

score 4.59999990463257 

seq PLQWSLLVAWAG/SV 

Uk221> polyA^signal 
^^ <222> 708. .713 

ril|<221> polyT^site 
j^^J<222> 732. .743 

'^;J<400> 258 

^J^cttccgcct gcgcetgcgc agcvcagete cshgagccct gccaaccatg gtgaacttgg 60 
Ul^tctgtcccg ggtggacgac gecgtggctg ccaageaccc ggcaccggc atg gee ttt 118 

Met Ala Phe 

ggc ttg cag atg ttc att cag agg aag ttt eca tac cct ttg cag tgg 166 

Gly Leu Gin Met Phe lie Gin Arg Lys Phe Pro Tyr Pro Leu Gin Trp 

-25 -20 -15 -10 

age etc eta gtg gee gtg gtt gca ggc tet gtg gtc age tac ggg gtg 214 

Ser Leu Leu Val Ala Val Val Ala Gly Ser Val Val Ser Tyr Gly Val 

-5 15 
acg aga gtg gag teg gag aaa tgc aac aae etc tgg etc ttc ctg gag 262 
Thr Arg Val Glu Ser Glu Lys Cys Asn Asn Leu Trp Leu Phe Leu Glu 

10 15 20 

ace gga eag etc ccc aaa gac agg age aca gat cag ara age 304 
Thr Gly Gin Leu Pro Lys Asp Arg Ser Thr Asp Gin Xaa Ser 

25 30 35 

taggagagct eeagcagggg eaeagargat tgggggcagg argartctgg aacacakect 364 
teatgcccee tgaccccagg ccgaceetce ceacaeecta gggtaeccca gtcgtateet 424 
etgtccgeat gtgtggeeag gectgacaaa emcetgcaga tggctgctgc cccaaeetgg 484 
gacetgccea ggaggttgga geagaaaggg etcteeetgg ggtggtgttt eteetctagg 544 
gtattgggat geatgttetg cactgceagc agagagggtg tgtetggggg ceaccaccta 604 
tgggacacgg ggtcgaaggg gcctgtacac tctgtcattt ectttctage ccctgcatct 664 
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ccaacaagtc caaggtgaca gctggtgcta ggggcgtggg gttaataaat ggcttatcct 724 
tctctccaaa araaaaaaam c 745 



<210> 259 

<211> 637 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 201. ,419 

<221> sigjpeptide 

<222> 201. .272 

<223> Von Heijne matrix 

score 6.40000009536743 

seq LSYLPLWLGPIWP/CS 

<221> polyA_signal 
f^222> 601. .606 

Jf'k221> polyA_site 
;^^f<222> 627. .637 

^^^"^400> 259 

i^acaaaatata attgcctcts ccctctccca ttttctctct tgggagcaat ggtcacagtc 60 
y^bctggtacct gaaaaggtac ctaggtctag gcccttcttc cctttccctt cctctcccct 120 
ij'kccccagaac tttggctccc tttcccttct ctctctggta gctccaggag gcctgtgatc 180 
.i cagctccctg cctagcatcc atg acc tgt tgg atg tta cot cca ate agt ttc 233 

Met Thr Cys Trp Met Leu Pro Pro lie Ser Phe 
m -2 0 -15 

!ctg tec tac ctg act ctt tgg ctt gga cct ata tgg cca tgc tct ggc 281 
:'Xieu Ser Tyr Leu Pro Leu Trp Leu Gly Pro lie Trp Pro Cys Ser Gly 

•2 -10 -5 1 

'•j|itct acc ctt ggg aag cct gat ccc ggt gtg tgg ccc age ttg ttc agg 329 
Wfeer Thr Leu Gly Lys Pro Asp Pro Gly Val Trp Pro Ser Leu Phe Arg 
5 10 15 

ccc tgg gat get gca tet cca ggc aac tat gca ctt tec egg gga rar 377 
Pro Trp Asp Ala Ala Ser Pro Gly Asn Tyr Ala Leu Ser Arg Gly Xaa 
20 25 30 35 

aac cak tat gav aak tgg ggg cag ggc aca cat tea tct ttg 419 
Asn Xaa Tyr Xaa Xaa Trp Gly Gin Gly Thr His Ser Ser Leu 

40 45 
targaaggtc tggcctgggg tcrggtgaag gagggeccag gtcagttctg gggtcccagt 479 
gaectgettt gecattctcc tggtgecgct getgetccct gtttctggag etggatgttc 539 
eccacctgge agttgagetg cctgagecaa tgtgtetgtc tttggtaact gagtgaacca 599 
taataaaggg gaaeatttgg ccctgtgaaa aaaaaaaa 637 



<210> 260 

<211> 1315 

<212> DNA 

<213> Homo sapiens 
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<220> 
<221> CDS 
<222> 123. .302 

<221> sig_peptide 

<222> 123. .176 

<223> Von Heijne matrix 

score 4.30000019073486 

seq WTCLKSFPSPTSS/HA 

<221> polyA_signal 
<222> 1279. .1284 

<221> polyA_site 
<222> 1301. .1312 

<400> 260 

aagagcatcc tgcgccccgg cgcggggccc tgcggtagcc tcaggcccct cccctggacc 60 

cgccgcagag ccagtgcaga atacagaaac tgcagccatg accacgcacg tcaccctgga 120 

ag atg ccc tgt cca acg tgg acc tgc ttg aag age ttc ccc tec ccg 167 
Met Pro Cys Pro Thr Trp Thr Cys Leu Lys Ser Phe Pro Ser Pro 
p^^l -15 -10 -5 

ji^cc age age cat gca teg age etc cac ctt ect cca tea tgt acc agg 215 
rtrhr Ser Ser His Ala Ser Ser Leu His Leu Pro Pro Ser Cys Thr Arg 

1 5 10 

;''pta act ttg aca caa act ttg agg aca gga atg cat ttg tea egg gca 263 

Leu Thr Leu Thr Gin Thr Leu Arg Thr Gly Met His Leu Ser Arg Ala 
;|S 15 20 25 

y,ittg caa ggt aca ttg acc agg eta eag tec act cca gca tgaatgarat 312 
IJ'ljeu Gin Gly Thr Leu Thr Arg Leu Gin Ser Thr Pro Ala 

30 35 40 

.^getggaggaa ggacatgakt atgeggtcat getgtacacc tggegeagct gttcecgggc 372 

=]pattcceeag gtgaaatgea acragcagee eaaccgakta raratctatg araaracagt 432 

J^raggtgetg gagceggagg tcaceaagct catgaagttc atgtatttte arcgeaagge 492 

pategagcgg ttctgcascg aggtgaagcg getgtgecat geegagcgca ggaaggactt 552 

j;j:gtetctgag geetacctcc tgaeccttgg caagttcatc aaeatgtttg ctgtcctgga 612 

fj'itgagetaaag aacatgaast gcagegtcaa raatgaeeac tctgcctaca agagggeagc 672 

j;keagttcctg eggaagatgg eagatcceca gtctatceag gagtegeaga acctttecat 732 

gtteetggcc aaeeacaaca ggatcaccca gtgtctccac cageaaettg aagtgatcec 792 

aggctatgag gagctgctgg etgacattgt caacatctgt gtggattact acgagaacaa 852 

gatgtaeetg actcceagtg agaaaeatat gctectcaag gtaaaactce cctgaggccg 912 

eacecatgga gcctgggctt accctcteae cttcttetta ttaaaaatcc gttttaaaaa 972 

acaatgttte ttttttctta aacattgata cagatettac ggcaeataat ggtttgtaac 1032 

ctgttecttt cctgtaatat aatataccgt agtcaccttt ceagatgtca ttaaggctat 1092 

ttctaeaatg ttatgtgtaa tgaetgccaa gtattctgtt gtattggaae attgtcatgt 1152 

aacatatcce ctgtggttgg atatttgeta aacttcattg aacacecttg tageagtttt 1212 

tgtgcacate tttttgtcaa ggcaaaettc ctagaagaga aattgctggc teaaagggaa 1272 

aaacagaata aatcgttttt tttatttcaa aaaaaaaaaa ccc 1315 



<210> 261 

<211> 1035 

<212> DNA 

<213> Homo sapiens 



<220> 



-199- 

<221> CDS 
<222> 98. .673 

<221> sig_peptide 

<222> 98. .376 

<223> Von Heijne matrix 

score 5.59999990463257 

seq VLLLRQLFAQAEK/WY 

<221> polyA_site 
<222> 1025, .1035 

<400> 261 

aattttcygt ggtccaacta ccctcggcga tcccaggctt ggcggggcac cgcctggcct 60 
ctcccgttcc tttaggctgc cgccgctgcc tgccgcc atg gca gag ttg ggc eta 115 

Met Ala Glu Leu Gly Leu 
-90 



aat 


gag 


cac 


eat 


eaa 


aat 


gaa 


gtt 


att 


aat 


tat 


atg 


egt 


ttt 


get 


cgt 


163 


Asn 


Glu 


His 


His 


Gin 


Asn 


Glu 


Val 


He 


Asn 


Tyr 


Met 


Arg 


Phe 


Ala 


Arg 








-85 










-80 










-75 










tea 


aag 


aga 


ggc 


ttg 


aga 


etc 


aaa 


act 


gta 


gat 


tec 


tge 


ttc 


caa 


gae 


211 


,..^er 


Lys 


Arg 


Gly 


Leu 


Arg 


Leu 


Lys 


Thr 


Val 


Asp 


Ser 


Cys 


Phe 


Gin 


Asp 






-70 










-65 










-60 














aag 


gag 


age 


agg 


ctg 


gtg 


gag 


gae 


acc 


ttc 


aee 


ata 


gat 


gaa 


gtc 


259 


rieu 


Lys 


Glu 


Ser 


Arg 


Leu 


Val 


Glu 


Asp 


Thr 


Phe 


Thr 


He 


Asp 


Glu 


Val 




f:!r55 










-50 










-45 










-40 




"Jet 


gaa 


gtc 


etc 


aat 


gga 


tta 


caa 


get 


gtg 


gtt 


cat 


agt 


gag 


gtg 


gaa 


307 


i%er 


Glu 


Val 


Leu 


Asn 


Gly 


Leu 


Gin 


Ala 


Val 


Val 


His 


Ser 


Glu 


Val 


Glu 












-35 










-30 










-25 






Ijlct 


gag 


etc 


ate 


aac 


act 


gee 


tat 


ace 


aat 


gtg 


tta 


ctt 


ctg 


cga 


cag 


355 


Si Ser 


Glu 


Leu 


He 


Asn 


Thr 


Ala 


Tyr 


Thr 


Asn 


Val 


Leu 


Leu 


Leu 


Arg 


Gin 










-20 










-15 










-10 








n P^^ 


ttt 


gca 


caa 


get 


gag 


aag 


tgg 


tat 


ctt 


aag 


eta 


cag 


aca 


gae 


ate 


403 




Phe 


Ala 
-5 


Gin 


Ala 


Glu 


Lys 


Trp 
1 


Tyr 


Leu 


Lys 


Leu 
5 


Gin 


Thr 


Asp 


He 




\ct 


gaa 


ctt 


gaa 


aac 


cga 


gaa 


tta 


tta 


gaa 


caa 


ktt 


gca 


gaa 


ttt 


gaa 


451 


^liiiger 


Glu 


Leu 


Glu 


Asn 


Arg 


Glu 


Leu 


Leu 


Glu 


Gin 


Xaa 


Ala 


Glu 


Phe 


Glu 














15 










20 










25 




aaa 


gca 


rav 


att 


aea 


tct 


tea 


aac 


aaa 


aag 


ecc 


ate 


tta 


dat 


gtc 


aca 


499 


Lys 


Ala 


Xaa 


He 


Thr 


Ser 


Ser 


Asn 


Lys 


Lys 


Pro 


He 


Leu 


Xaa 


Val 


Thr 












30 










35 










40 






aas 


cca 


aaa 


ett 


get 


cca 


ctt 


aat 


gaa 


ggt 


gga 


aca 


gca 


aaa 


etc 


eta 


547 


Xaa 


Pro 


Lys 


Leu 


Ala 


Pro 


Leu 


Asn 


Glu 


Gly 


Gly 


Thr 


Ala 


Lys 


Leu 


Leu 










45 










50 










55 








aac 


aag 


gta 


ata 


tgt 


att 


att 


ttg 


aga 


aac 


gga 


aag 


tct 


etc 


att 


ctg 


595 


Asn 


Lys 


Val 


He 


Cys 


He 


He 


Leu 


Arg 


Asn 


Gly 


Lys 


Ser 


Leu 


He 


Leu 








60 










65 










70 










tec 


tgt 


cat 


tge 


eta 


ggg 


tgg 


aga 


aac 


aaa 


agt 


gga 


agg 


ttt 


gtt 


tea 


643 


Ser 


Cys 


His 


Cys 


Leu 


Gly 


Trp 


Arg 


Asn 


Lys 


Ser 


Gly 


Arg 


Phe 


Val 


Ser 






75 










80 










85 












ggt 


cct 


ctg 


agg 


ata 


att 


agt 


cca 


ttg 


cag 


tagttttact tgatggtaec 


693 


Gly 


Pro 


Leu 


Arg 


He 


He 


Ser 


Pro 


Leu 


Gin 

















90 95 

ecatgggeca gaagagggea tacttaaect tetagagage etgaagtagc tcctgatcac 753 

acetttteaa ggtaaagtga agagcatgaa attttggaca gegtttattg atggacattt 813 

aaagtttgtg atctgcggta acaaggagaa gggtttttaa gtttataaaa attatttate 8 73 

aattagccgg gtgtggtggt aegtgcctat agtcagagct aetegggagg etgaggcagg 933 



-200- 



agaattgctt gaacccggga ggtggaggtt gcagtgagct gagatcacgc cactgcactc 993 
tagcctgggc gacagagcga gactccatct caaaaaaaaa aa 1035 



<210> 262 
<211> 696 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 17 . .463 

<221> sig_jpeptide 

<222> 17. .232 

<223> Von Heijne matrix 

score 3.79999995231628 

seq LMGLALAVYKCQS/MG 

<221> polyA_signal 
,:=,,<222> 657. ,662 

^J^^221> polyA_site 
^^;;k222> 684 . .696 

yk400> 262 

4!^actcaaacag attccc atg aat etc ttc ate atg tac atg gca ggc aat act 52 
hi Met Asn Leu Phe lie Met Tyr Met Ala Gly Asn Thr 

yi -70 -65 

ate tee ate ttc eet act atg atg gtg tgt atg atg gee tgg ega cce 100 
^,„^:ale Ser lie Phe Pro Thr Met Met Val Cys Met Met Ala Trp Arg Pro 
J;^p60 -55 -50 -45 

^'l^tt cag gea ctt atg gee att tea gee act ttc aag atg tta gaa agt 148 
:;"^Ile Gin Ala Leu Met Ala lie Ser Ala Thr Phe Lys Met Leu Glu Ser 

-40 -35 -30 

y^itea age cag aag ttt ctt cag ggt ttg gte tat etc att ggg aac ctg 196 
ClJISer Ser Gin Lys Phe Leu Gin Gly Leu Val Tyr Leu lie Gly Asn Leu 
-25 -20 -15 

atg ggt ttg gca ttg get gtt tac aag tge cag tee atg gga etg tta 244 
Met Gly Leu Ala Leu Ala Val Tyr Lys Cys Gin Ser Met Gly Leu Leu 

-10 -5 1 

eet aca cat gca teg gat tgg tta gee ttc att gag eee eet gag aga 292 
Pro Thr His Ala Ser Asp Trp Leu Ala Phe lie Glu Pro Pro Glu Arg 
5 10 15 20 

atg gag tea gtg gtg gag gac tge ttt tgt gaa cat gag aaa gea gcg 340 
Met Glu Ser Val Val Glu Asp Cys Phe Cys Glu His Glu Lys Ala Ala 

25 30 35 

eet ggt ecc tat gta ttt ggg tet tat tta eat eet tet tta age cea 388 
Pro Gly Pro Tyr Val Phe Gly Ser Tyr Leu His Pro Ser Leu Ser Pro 

40 45 50 

gtg get cct cag cat act ctt aaa eta ate act tat gtt aaa aaa aae 436 
Val Ala Pro Gin His Thr Leu Lys Leu lie Thr Tyr Val Lys Lys Asn 

55 60 65 

caa aaa act ctt ttc tee atg gtg ggg tgaeaggtee taaaaggaca 483 
Gin Lys Thr Leu Phe Ser Met Val Gly 
70 75 



-201- 



atgtgcatat tacgacaaac acaaaaaaac tataccataa cccagggctg aaaataatgt 543 

aaaaaacttt atttttgttt ccagtacaga gcaaaacaac aacaaaaaaa cataactatg 603 

taaacaaaaa aataactgct gctaaatcaa aaactgttgc agcatctcct ttcaataaat GG2 

taaatggttg araacaatgc aaaaaaaaaa aaa 696 



<210> 263 
<211> 868 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 263. .481 

<221> sig_peptide 

<222> 263 . .322 

<223> Von Heijne matrix 

score 11.1999998092651 

seq ILWLMGLPLAQA/LD 

"p^221> polyA^site 
5'<222> 858. .868 

'!j<400> 263 

=jkagacacgcc tacgattaga ctcaggcagg cacctaccgg cgagcggccg crvgtgactc 60 
l^caggcgcgg cggtacctca cggtggtgaa ggtcacaggg ttgcagcact cccagtagac 12 0 
i^ifcaggagctcc gggaggcagg gccggcccca cgtcctctgc gcaccaccct gagttggatc 180 
J'btctgtgcgc cacccctgag ttggatccag ggctagctgc tgttgacctc cccactccca 240 
: cgctgccctc ctgcctgcag cc atg acg ccc ctg etc acc ctg ate ctg gtg 292 
^ii: Met Thr Pro Leu Leu Thr Leu lie Leu Val 

IJ -20 -15 

Jgtc etc atg ggc tta cct ctg gcc cag gcc ttg gac tgc cac gtg tgt 340 
"Val Leu Met Gly Leu Pro Leu Ala Gin Ala Leu Asp Cys His Val Cys 

It^^ -5 15 

f^^cc tac aac gga gac aac tgc ttc aac ccc atg cgc tgc ccg get atg 388 
iklsi Tyr Asn Gly Asp Asn Cys Phe Asn Pro Met Arg Cys Pro Ala Met 
10 15 20 

gtt gcc tac tgc atg acc acg cgc acc tac tac ace ccc acc agg atg 436 
Val Ala Tyr Cys Met Thr Thr Arg Thr Tyr Tyr Thr Pro Thr Arg Met 

25 30 35 

aag gtc agt aag tec tgc gtg ccc cgc tgc ttc gar nac tgt gta 481 
Lys Val Ser Lys Ser Cys Val Pro Arg Cys Phe Glu Xaa Cys Val 

40 45 50 

tgatggctac tccaageacg cgtecaccae etcctgetge cagtacgace tctgeaaegg 541 
caccggectt gceacccegg ccacectggc cctggcccec atectcctgg ccaccctctg 601 
gggtcteete taaagccecc gaggeagace cactcaagaa caaagctetc gagacaeact 661 
getayacect ckcacccakc tcacectgcc tcaccctcca cactcectgc gaectcctca 721 
gecatgecca gggtcaggae tgtgggcaag aagacacccg acctceccca accaecacac 781 
gaccteactt cgaggccttg acctttcgat getgtgtggg atcccaaaag tgtceggett 841 
tgatgggctg atcagcaaaa aaaaaaa 868 



<210> 264 
<211> 775 



-202- 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 42. .299 

<22l> sig_peptide 

<222> 42. .101 

<223> Von Heijne matrix 

score 5.40000009536743 

seq WFVHSSALGLVLA/PP 

<221> polyA_site 
<222> 762 . .775 



<400> 264 

aacgatacaa atggtaggcc ttcatgtgag ccagtdacta c atg aat ctt cat ttc 56 

Met Asn Leu His Phe 
-20 



cca 


cag 


tgg 


ttt 


gtt 


cat 


tea 


tea 


gcg 


tta 


ggc 


ttg 


gtc 


ctg 


get 


cca 


104 




Gin 


Trp 


Phe 


Val 


His 


Ser 


Ser 


Ala 


Leu 


Gly 


Leu 


Val 


Leu 


Ala 


Pro 














-10 










-5 










1 




iJ'pct 


ttc 


tec 


tct 


ccg 


ggc 


act 


gac 


eee 


acc 


ttt 


ccg 


tgt 


att 


tac 


tgt 


152 


ntro 


Phe 


Ser 


Ser 


Pro 


Gly 


Thr 


Asp 


Pro 


Thr 


Phe 


Pro 


Cys 


He 


Tyr 


Cys 










5 










10 










15 










eta 


tta 


aat 


atg 


ate 


atg 


acc 


cgc 


ctt 


gea 


ttt 


tea 


ttc 


ate 


ace 


200 




Leu 


Leu 


Asn 


Met 


He 


Met 


Thr 


Arg 


Leu 


Ala 


Phe 


Ser 


Phe 


He 


Thr 








20 










25 










30 












tta 


tgc 


cca 


aat 


tta 


aag 


gaa 


gtt 


tgt 


etc 


att 


ttg 


cca 


gaa 


aaa 


248 


Cys 


Leu 


Cys 


Pro 


Asn 


Leu 


Lys 


Glu 


Val 


Cys 


Leu 


He 


Leu 


Pro 


Glu 


Lys 






35 










40 










45 












11 ^at 


tgt 


aat 


agt 


cga 


cac 


get 


gga 


ttt 


gta 


ggg 


eca 


sea 


aaa 


ttg 


egg 


296 


; '^Asn 


Cys 


Asn 


Ser 


Arg 


His 


Ala 


Gly 


Phe 


Val 


Gly 


Pro 


Xaa 


Lys 


Leu 


Arg 














55 










60 










65 




'^Jbag 


tgaaactwkk ttcwcttcta aagcccttca tttcccacaa 


ggttaagcte 




349 


;;^!fccgaaacccc atttgatcct tggttcctat ttcgatcctc 


ctttggaatc tgaaaatcgg 


409 



tetccatgtt gtatgcaaat taaaakttgc cttgtttgtt actcttccaa eaeagggtat 469 

cagggaraaa gaggccttat etgtteetcc ateccccctg ttttgacaga ctgetaagaa 529 

ttcctcagga ettectttgg ttggggattt tactttecca aaagtctgat ctgatttett 589 

tcaggggtag acaagcttgt cctagtgete tgcttcaggt cttatcagaa gaaacccagg 649 

aatagaaaag gtagatgcct tgacttttgt ccctgttgtg gggactaaag tgttttttgc 709 

cagaattgte aaaagctccg gttcaaactc tgtagagttt catggaaaaa caaaacaaaa 769 

aaaaaa 775 



<210> 265 

<211> 1075 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 198. .431 



-203- 

<221> sig_j)eptide 

<222> 198. .260 

<223> Von Heijne matrix 

score 6.90000009536743 

seq LLACGSLLPGLWQ/HL 

<221> polyA__site 
<222> 1064. .1074 

<400> 265 

atatatttct gaggcagtac ccatctcact tgtaaactta aaagacaccg cagagatttg 60 

^ggg^ctcag aagtcaaata gagtaggtta aaaacctctt atttttcaaa ttaattgttt 120 

taagaaacaa gcatacctgt gtaagtgaaa tatcttaatt tgtgttgaat caagttagga 180 

gacagagatt ctcatga atg tgt cct gtg ttc tea aag cag ctg eta gcc 23 0 

Met Cys Pro Val Phe Ser Lys Gin Leu Leu Ala 















-20 










-15 












tgt 


ggg 


tct 


etc 


eta 


cct 


ggg 


tta 


tgg 


cag 


cae 


etc 


aca 


gee 


aat 


cae 


278 


Cys 


Gly 


Ser 


Leu 


Leu 


Pro 


Gly 


Leu 


Trp 


Gin 


His 


Leu 


Thr 


Ala 


Asn 


His 




-10 










-5 










1 








5 






tgg 


cct 


cca 


ttc 


tec 


set 


ttc 


etc 


tgt 


aca 


gtt 


tge 


tct 


ggt 


tee 


tea 


326 


Trp 


Pro 


Pro 


Phe 
10 


Ser 


Xaa 


Phe 


Leu 


Cys 
15 


Thr 


Val 


Cys 


Ser 


Gly 
20 


Ser 


Ser 




'i%ag 


cag 


att 


tec 


gag 


tat 


act 


get 


tea 


gcc 


aeg 


ecc 


cca 


ctg 


tge 


cgt 


374 




Gin 


He 
25 


Ser 


Glu 


Tyr 


Thr 


Ala 
30 


Ser 


Ala 


Thr 


Pro 


Pro 
35 


Leu 


Cys 


Arg 






ctg 


aac 


caa 


gag 


cca 


ttc 


gty 


tea 


aga 


gee 


att 


cgt 


cca 


aag 


tac 


422 


Ser 


Leu 
40 


Asn 


Gin 


Glu 


Pro 


Phe 
45 


Val 


Ser 


Arg 


Ala 


He 
50 


Arg 


Pro 


Lys 


Tyr 





J,fcct ate ace tagecattgt akecatacca agcegggctt cetaettccc 471 
JlSer He Thr 



I 55 

,;^i,tetgeteecc ttggttteet ectgtraart aaatctcact gaeccttgat geasetccaa 531 

"^gcatatataa tatatatata ataaaaccat abtctaaaaa attcaaacca ggawaaataa 591 

rasccaraaat ttgtatggga aaaatetgea caaatttatt tggceagcat ggttateatg 651 

gctetattga atttatcett gaccgtettt aaageeaaag caaaegggat aaagtgatea 711 

■J^ctacttaee tctcaataee aaaaargaag eaggaggcaa aatctctcaw taatttcata 771 

j4aaaeaatte ttakctggge geggtggcte weacctgtar teccaacaet ttgggaggce 831 

Jlbaggtgggeg gateatgagg tegggagatc aamaceatcc tggetaacat ggtgaaaccc 891 

eatctctact aaaattacaa aaaattrgct gggegaggtg gegggeacct gtggtcccag 951 

ctacteggga ggctgaggea agagaatggt gtgaaeeeea gggggeggag cctgcagtga 1011 

getgagateg caecactgea ctecagcetg ggcgacagtg agactcegtc tcaaaaaaaa 1071 

aaah 1075 



<210> 266 
<211> 981 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 279. .473 

<221> sig_peptide 

<222> 279. .362 

<223> Von Heijne matrix 



-204- 



score 4.40000009536743 
seq SCFLVALIIWCYL/RE 

<221> polyA_signal 
<222> 944. .949 

<221> polyA_site 
<222> 970. .981 



<400> 266 

agaatcgtgt cttgtgtgcc ccggcggccg ggtgagctcc tcaaggtctc ggagggccga 60 
gggcagacac cggcgggcgg gcggasgctt actgctctct ctcttccagg gccgtccggg 120 
cgctgaggct cataggctgg gcttcccgaa gccttcatcc gttgcccggt tcccgggatc 180 
gggcccaccc tgccgccgag gaagaggacg accctgaccg ccccattgag ttttcctcca 240 
gcaaagccaa ccctcaccgc tggtcggtgg gccatacc atg gga aag gga cat cag 296 

Met Gly Lys Gly His Gin 
-25 

egg CGC tgg tgg aag gtg ctg ccc etc age tgc ttc etc gtg gcg ctg 344 
Arg Pro Trp Trp Lys Val Leu Pro Leu Ser Cys Phe Leu Val Ala Leu 

-20 -15 -10 

ate ate tgg tge tac ctg agg gag gag age gag gcg gae eag tgg ttg 392 
f^le lie Trp Cys Tyr Leu Arg Glu Glu Ser Glu Ala Asp Gin Trp Leu 
% -5 1 5 10 

jj'ipga eag gtg tgg gga gag gtg cca gag ccc agt gat cgt tet gag gag 440 
pjkrg Gin Val Trp Gly Glu Val Pro Glu Pro Ser Asp Arg Ser Glu Glu 

20 25 
^=;|Jcct gag act cca get gee tac aga gcg aga act tgaeggggtg cccgctgggg 493 
4^Pro Glu Thr Pro Ala Ala Tyr Arg Ala Arg Thr 
U 3 0 35 

IJIctggcaggaa gggagcegac aseegeectt eggatttgat ktcaegtttg cccgtgaetg 553 
s: tcetggetat gcktgegtec teageaetra argaettggc tggtggatgg ggcaettggc 613 
|,l^atgetgatt cgcgtgaagg eggavcaaaa tetcagcaaa teggaaactg eteetcscct 673 
l^lggetcttgat ktccaaggat tecateggea aaacttctea rateettggg gaaggtttea 733 
^^^gttgcaetgt atgetgttgg atttgeeaag tctttgtata acataatcat gttteeaaag 793 
; pacttetggt gacacttgte atccagtgtt agtttgcagg taatttgett tetgagatag 853 
^Jjaatatetggc agaagtgtga aactgtattg eatgetgegg ectgtgcaag gaacacttee 913 
•;55%catgtgagt tttaeaeaac aacaaatgaa aataaatttt aattttataa tatgggaaaa 973 
i^jlaaaaaaaa 981 



<210> 267 

<211> 1031 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 12 . .644 

<221> sig_jpeptide 
<222> 12 . .92 
<223> Von Heijne matrix 
score 4 

seq LTFFSGVYGTCIG/AT 



<221> polyA_signal 



-205- 

<222> 1002 . .1007 

<221> polyA_site 
<222> 1020. .1031 

<400> 267 

acaccaagga g atg etc ctt ctt agt att aca act get tat aca ggt ctg 50 

Met Leu Leu Leu Ser lie Thr Thr Ala Tyr Thr Gly Leu 
-25 -20 -15 



gaa 


tta 


act 


ttc 


ttc 


tct 


ggt 


gta 


tat 


gga 


ace 


tgt 


att 


ggt 


get 


aca 


98 


Glu 


Leu 


Thr 


Phe 


Phe 


Ser 


Gly 


Val 


Tyr 


Gly 


Thr 


Cys 


He 


Gly 


Ala 


Thr 












-10 










-5 










1 






aat 


aaa 


ttt 


gga 


gca 


gaa 


gag 


ara 


age 


ctt 


att 


gga 


ctt 


tct 


ggc 


att 


146 


Asn 


Lys 


Phe 


Gly 


Ala 


Glu 


Glu 


Xaa 


Ser 


Leu 


He 


Gly 


Leu 


Ser 


Gly 


He 








5 










10 










15 










ttc 


ate 


ggc 


att 


gga 


gaa 


att 


tta 


ggt 


gga 


age 


etc 


ttc 


ggc 


ctg 


ctg 


194 


Phe 


He 


Gly 


He 


Gly 


Glu 


He 


Leu 


Gly 


Gly 


Ser 


Leu 


Phe 


Gly 


Leu 


Leu 






20 










25 










30 












age 


aag 


aac 


aat 


egt 


ttt 


ggt 


aga 


aat 


cca 


gtt 


gtg 


ctg 


ttg 


ggc 


ate 


242 


Ser 


Lys 


Asn 


Asn 


Arg 


Phe 


Gly 


Arg 


Asn 


Pro 


Val 


Val 


Leu 


Leu 


Gly 


He 




35 










40 










45 










50 




..ctg 


gtg 


eac 


ttc 


ata 


get 


ttt 


tat 


eta 


ata 


ttt 


etc 


aac 


atg 


cet 


gga 


290 


ieu 


Val 


His 


Phe 


He 


Ala 


Phe 


Tyr 


Leu 


He 


Phe 


Leu 


Asn 


Met 


Pro 


Gly 












55 










60 










65 






^%at 


gee 


ccg 


att 


get 


cet 


gtt 


aaa 


gga 


act 


gae 


age 


agt 


get 


tae 


ate 


338 




Ala 


Pro 


He 


Ala 


Pro 


Val 


Lys 


Gly 


Thr 


Asp 


Ser 


Ser 


Ala 


Tyr 


He 










70 










75 










80 








'iP'- 

saaa 


tec 


age 


aaa 


raa 


ttt 


gee 


att 


etc 


tgc 


akt 


ttt 


ctg 


tkg 


ggc 


ctt 


386 




Ser 


Ser 


Lys 


Xaa 


Phe 


Ala 


He 


Leu 


Cys 


Xaa 


Phe 


Leu 


Xaa 


Gly 


Leu 








85 










90 










95 












aac 


age 


tgc 


ttt 


aat 


ace 


cas 


ctg 


ctt 


akt 


ate 


tkg 


ggc 


ttt 


ctg 


434 




Asn 


Ser 


Cys 


Phe 


Asn 


Thr 


Xaa 


Leu 


Leu 


Xaa 


He 


Xaa 


Gly 


Phe 


Leu 




■;tat 


100 










105 










110 












tct 


gaa 


rac 


age 


gcc 


cea 


kea 


ttt 


gee 


ate 


ttc 


aat 


ttt 


gtt 


cag 


482 


'tyr 


Ser 


Glu 


Xaa 


Ser 


Ala 


Pro 


Xaa 


Phe 


Ala 


He 


Phe 


Asn 


Phe 


Val 


Gin 




;4l5 










120 










125 










130 




Sitct 


att 


tgc 


gca 


gcc 


gtg 


gca 


ttt 


ttc 


tae 


age 


aac 


tae 


ctt 


etc 


ctt 


530 


l;^er 


He 


Cys 


Ala 


Ala 


Val 


Ala 


Phe 


Phe 


Tyr 


Ser 


Asn 


Tyr 


Leu 


Leu 


Leu 












135 










140 










145 






eac 


tgg 


eaa 


etc 


ctg 


gtc 


atg 


gtk 


atw 


ttt 


ggg 


ttt 


ttk 


gga 


aca 


att 


578 


His 


Trp 


Gin 


Leu 


Leu 


Val 


Met 


Val 


He 


Phe 


Gly 


Phe 


Xaa 


Gly 


Thr 


He 










150 










155 










160 








tct 


ttc 


ttc 


act 


gtg 


gaa 


tgg 


gaa 


set 


gee 


gee 


ttt 


gta 


sec 


cgc 


gge 


626 


Ser 


Phe 


Phe 


Thr 


Val 


Glu 


Trp 


Glu 


Xaa 


Ala 


Ala 


Phe 


Val 


Xaa 


Arg 


Gly 





165 170 175 



tct gac tac cga agt ate tgatctggtg tccgtgaggg gacacgtatg 674 
Ser Asp Tyr Arg Ser He 
180 

acctcagaaa eacagctgga caeagagett ggtggaagaa gtegectttg atettcaeta 734 

tatattgggt gatgttcagt atggaaaatc aagggattaa gactgttaaa teagceagag 794 

tkggtgttca agtttaeaga tatgagttat ttaaagcaag tagaataagg gaaagctgtt 854 

ctgtcaactg taattgttca aagatgttgt ttttcatttc atetatctca attcttataa 914 

tcatgttata gaatgtaaat gttttcttct ctctcctgct cttgttggaa gatcetgect 974 

tgatttagaa tactaggcca tatgtcatat aaatattttt tctggaaaaa aaaaaaa 1031 



-206- 

<210> 268 

<211> 1283 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 91. .459 

<221> sig__peptide 

<222> 91. .330 

<223> Von Heijne matrix 

score 7.69999980926514 

seq LVLFLSLALLVTP/TS 

<221> polyA_site 
<222> 1271. .1281 

<400> 268 

tattccttgg agttccacga ctgaattaag actgttgtgg grdccataat tttcaaatac 60 
ttgccctata ttcgtgttga gggttcacac atg age aca tgg tat ttg gca ctt 114 
^ Met Ser Thr Trp Tyr Leu Ala Leu 

t -80 -75 



J^at 


aag 


tec 


tat 


aag 


aat 


aaa 


gac 


age 


gtt 


agg 


att 


tat 


etc 


age 


ttg 


162 




Lys 


Ser 


Tyr 


Lys 


Asn 


Lys 


Asp 


Ser 


Val 


Arg 


He 


Tyr 


Leu 


Ser 


Leu 








-70 










-65 










-60 












aca 


gtg 


age 


att 


aaa 


ttt 


aca 


tac 


ttt 


cat 


gat 


ata 


eag 


act 


aat 


210 


tCys 


Thr 


Val 


Ser 


He 


Lys 


Phe 


Thr 


Tyr 


Phe 


His 


Asp 


He 


Gin 


Thr 


Asn 






-55 










-50 










-45 












ytgt 


ctt 


aca 


aca 


tgg 


aaa 


cat 


teg 


aga 


tgc 


aga 


ttt 


tat 


tgg 


gca 


ttt 


258 


li Cys 


Leu 


Thr 


Thr 


Trp 


Lys 


His 


Ser 


Arg 


Cys 


Arg 


Phe 


Tyr 


Trp 


Ala 


Phe 














-35 










-30 










-25 




n ggt 


ggt 


tec 


att 


tta 


eag 


cac 


tea 


gtg 


gat 


ccc 


ctt 


gtt 


ttg 


ttc 


eta 


306 




Gly 


Ser 


He 


Leu 


Gin 


His 


Ser 


Val 


Asp 


Pro 


Leu 


Val 


Leu 


Phe 


Leu 










-20 










-15 










-10 








ctg 


gee 


ctg 


tta 


gtg 


aca 


eee 


act 


tec 


ace 


cct 


tet 


get 


aar 


ata 


354 


;5i^er 


Leu 


Ala 


Leu 
-5 


Leu 


Val 


Thr 


Pro 


Thr 
1 


Ser 


Thr 


Pro 


Ser 


Ala 


Lys 


He 




car 


age 


ctt 


caa 


att 


gac 


etc 


cet 


gga 


ggc 


tgg 


agg 


5 

ctg 


gee 


act 


gac 


402 


Gin 


Ser 


Leu 


Gin 


He 


Asp 


Leu 


Pro 


Gly 


Gly 


Trp 


Arg 


Leu 


Ala 


Thr 


Asp 






10 










15 










20 












agg 


ate 


ttt 


ace 


etc 


tee 


ccc 


gta 


ccc 


atg 


gac 


rgc 


eee 


etc 


ate 


Ctt 


450 


Arg 


He 


Phe 


Thr 


Leu 


Ser 


Pro 


Val 


Pro 


Met 


Asp 


Xaa 


Pro 


Leu 


He 


Leu 




25 










30 










35 










40 





cat eag ttg taaaggtaga tatttgttcc ttggagtcca aeatcatgct 499 
His Gin Leu 



gtteagaata taatgagatc aatagttgaa aaactagata tacatgecac ccwgaeaaag 559 

ctattaagtt attaagtgte agccctggat ettggcttat tgtgaaatgt taattatttt 619 

atcacteyat taagaagetg tgggetccat eteagcattg aaaagggact aatttgctet 679 

gttttggaat tgaattagct ttcaggccas cagggcactg tttggtaaat tgctttttee 739 

agtactagca tgttttctec ctecatagcc tctgttaget tetgagcttg taacetccag 799 

ggaaavatga gaatattcac cettttaata tgtgtagaga ecatgeaaga ecattgtett 859 

etaataatta gaaataetta gecagattct ctatagtaaa cceggagatt gggagggetg 919 

etttctactt ggtgcatcet tetgcgettc taatgatttt taaaaatetg ttaataattg 979 

atgttttctg getgggeaca gtggctcaeg cctgtaatcc eagcaetttg ggaggecaag 1039 

gagggcagat catgaggtca ggagattgar accateetgg etaaeacggt gaaaecccgt 1099 

ctetaetaaa aatacaaaar aattakeegg geatggtagt gggegcctgt gtaeceaget 1159 



-207- 



actggggagg ctgaggcarg araatcgctt gaacctggga ggcggaggtt gcastragct 1219 
gagatggtgc caccgcactc tagcctgggt gacagagcga gacttcattt caaaaaaaaa 1279 
aatnc 1283 



<210> 269 

<211> 1777 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 70 . .327 

<221> sig__peptide 

<222> 70. .147 

<223> Von Heijne matrix 

score 9.60000038146973 

seq WLIALASWSWALC/RI 

l-^if:221> polyA__signal 
;^^;^|<222> 1741. .1746 

;^;^221> polyA_site 
f;^222> 1763. .1774 

;;|!^<400> 269 

Wtkgcccggttt cgtgcccgcg gccgactgcg casctgtccg cgagtctgag atacttacag 60 
fj'kgagctaca atg gaa aag tec tgg atg ctg tgg aac ttt gtt gaa aga tgg 111 
- Met Glu Lys Ser Trp Met Leu Trp Asn Phe Val Glu Arg Trp 

-25 -20 -15 

I'^pta ata gcc ttg get tea tgg tct tgg get etc tgc egt att tct ctt 159 
i%eu lie Ala Leu Ala Ser Trp Ser Trp Ala Leu Cys Arg lie Ser Leu 
r= -5 1 

"%ta cct tta ata gtg act ttt cat ctg tat gga ggc att ate tta ctt 207 
yieu Pro Leu lie Val Thr Phe His Leu Tyr Gly Gly lie lie Leu Leu 
i3;i5 10 15 20 

ttg tta ata ttc ata tea atw kca ggt att ctg tat aaa ttc cas gat 255 
Leu Leu lie Phe lie Ser lie Xaa Gly lie Leu Tyr Lys Phe Xaa Asp 

25 30 35 

gta ttg ctt tat ttt ccw kaa cag yya tec tct tea egt ctt tat gat 303 
Val Leu Leu Tyr Phe Pro Xaa Gin Xaa Ser Ser Ser Arg Leu Tyr Asp 

40 45 50 

tec cat gee eac tgg emt teg rea taaaaaaatt ttcatcagaa ccaaagatgg 3 57 

Ser His Ala His Trp Xaa Ser Xaa 

55 60 
aatacgtctg aatcttattt tgataegata eactggagac aattcacect attecccaac 417 
tataatttat tttcatggga atgeaggcaa cataggtcac aggttggeca aatgcattac 477 
ttatgttggt taacctcaaa gttaaccttt tgctggttga ttatcgagga tatggaaaaa 53 7 
gtgaaggaga ageaagtgaa gaaggaetct acttagatte tgaagctgtg ttagactacg 597 
tgatgactag aectgacett gataaaacaa aaatttttet ttttggccgt tecttgggtg 657 
garcagtgge tatteatttg gettetgaaa attcacatag gattteagce attatggtgg 717 
agaaeaeatt tttaagcata ccacatatgg ceagcacttt attttcatte tttccgatge 777 
gttaccttee tttatggtgc tacaaaaata aatttttgtc etaeagaaaa atetetcagt 837 
gtagaatgcc tteacttttc atctctggac tetcagatca attaattcca ecagtaatga 897 
tgaaacaact ttatgaacte tccccatctc ggactaagan attageeatt ttteeagatg 957 



-208- 

ggactcacaa tgacacatgg cagtgccaag gctatttcac tgcacttgaa cagttcatca 1017 

aagaagtcgt aaagagccat tctcctgaag aaatggcaaa aacttcatct aatgtaacaa 1077 

ttatataatg tttccctttt tgattattgc attgtatttt aatttgtgca gaatgataaa 1137 

gaatgttcct tttagaagtg tgttatgtct gtacctgtct gaagagtgac attaaacttt 1197 

gaaaggactt cactgctcct ttacgatatt ccaaatagtt ttttacattg gaaaaactaa 1257 

ttcttgggat tctttcatac attttcatca aaactttcag tgtgattatg tattcatatc 1317 

ttcagtttaa tatgtcagta taatagatat tgttcaaaag tttcttgttg ctaaagtggt 1377 

gtaatctgtt acacagatga atagctagat gtggaaagag atatgtaaac aagaaacctt 1437 

tgggtattgt ttcttaagta aatattggga caatcatggt aagcaaactt agttctgtaa 1497 

ctgcattttt caccttaaaa gttaaatgaa atgcatgatg gtattttatt ccttgaatta 1557 

tgcaatgcaa cattttacat gtaaatagca ctggtcatat actgatgtat atggttatct 1617 

gggttatatc tatttttatg taaactctat ttttgttttt ggcaagaagt gaaattgaga 1677 

cttatgtgca ggttgccatt gaattttgct ctggtgaatg ctgagatcca gctttttctt 1737 

acaaataaat gggaccctgt tttccaaaaa aaaaaaamcm 1777 



<210> 270 
<211> 970 
<212> DNA 
<213> Homo sapiens 

1^^220 > 
Tf221> CDS 
::<222> 12 . .497 

sis: 

^^<221> sig_peptide 
fc222> 12. .104 
^k223> Von Heijne matrix 
i1 score 5 . 5 

seq LVGVLWFVSVTTG/PW 

'l|c221> polyA^signal 
J<222> 935. .940 

,J|c221> polyA^site 
;j;;k222> 955. .967 

<400> 270 

aggtctccaa g atg gcg gcc gcc tgg ccg tct ggt cog kct get cog gag 50 
Met Ala Ala Ala Trp Pro Ser Gly Pro Xaa Ala Pro Glu 
-30 -25 -20 

gcc gtg acg gcc aga etc gtt ggt gtc ctg tgg ttc gtc tea gtc act 98 
Ala Val Thr Ala Arg Leu Val Gly Val Leu Trp Phe Val Ser Val Thr 

-15 -10 -5 

aca gga ccc tgg ggg get gtt gcc acc tec gee ggg gge gag gag teg 146 
Thr Gly Pro Trp Gly Ala Val Ala Thr Ser Ala Gly Gly Glu Glu Ser 

15 10 
ett aag tgc gag gae etc aaa gtg gga caa tat att tgt aaa gat cea 194 
Leu Lys Cys Glu Asp Leu Lys Val Gly Gin Tyr lie Cys Lys Asp Pro 
15 20 25 30 

aaa ata aat gae get acg caa gaa cea gtt aac tgt aca aac tac aca 242 
Lys lie Asn Asp Ala Thr Gin Glu Pro Val Asn Cys Thr Asn Tyr Thr 

35 40 45 

get cat gtt tee tgt ttt cea gca ccc aac ata act tgt aag gat tec 2 90 

Ala His Val Ser Cys Phe Pro Ala Pro Asn lie Thr Cys Lys Asp Ser 
50 55 60 



-209- 



agt 


ggc 


aat 


gaa 


aca 


cat 


ttt 


act 


ggg 


aac 


gaa 


gtt 


ggt 


ttt 


ttc 


aag 


338 


Ser 


Gly 


Asn 


Glu 


Thr 


His 


Phe 


Thr 


Gly 


Asn 


Glu 


Val 


Gly 


Phe 


Phe 


Lys 








65 










70 










75 










ccc 


ata 


tct 


tgc 


cga 


aat 


gta 


aat 


ggc 


tat 


tec 


tac 


aat 


gag 


cag 


teg 


386 


Pro 


He 


Ser 


Cys 


Arg 


Asn 


Val 


Asn 


Gly 


Tyr 


Ser 


Tyr 


Asn 


Glu 


Gin 


Ser 






80 










85 










90 












cat 


gtc 


tct 


ttt 


tct 


tgg 


atg 


gtt 


ggg 


age 


aga 


teg 


att 


tta 


cct 


tgg 


434 


His 


Val 


Ser 


Phe 


Ser 


Trp 


Met 


Val 


Gly 


Ser 


Arg 


Ser 


He 


Leu 


Pro 


Trp 




95 










100 










105 










110 




ata 


ccc 


tgc 


ttt 


ggg 


ttt 


gtt 


aaa 


btt 


tyg 


cac 


tgt 


agg 


gtt 


tkg 


tgg 


482 


He 


Pro 


Cys 


Phe 


Gly 


Phe 


Val 


Lys 


Xaa 


Xaa 


His 


Cys 


Arg 


Val 


Xaa 


Trp 





115 120 125 



aat tgg gag cct aat tgatttcaty ettatttcaa tgcagattgt tggaccttca 537 
Asn Trp Glu Pro Asn 
130 

aatggaagta gttacattat agattactat ggaaccagac ttacaagact gagtattact 597 

aatgaaacat ttagaaaaac gcaattatat ccataaatat tttttaaaag aaacagattt 657 

gagectcctt gattttaata gagaacttet agtgtatgga tttaaagatt tctcttttte 717 

attcatatae cattttatga gttetgtata attttttgtg gtttttgttt tgttgagtta 777 

aagtatatta ttgtgagatt tatttaatag gaettcettt gaaagetgta taatagtgtt 837 

tctegggctt ctgtctctat gagagatagc ttattaetct gataetettt aatcttttae 897 

f'i^.aaggeaagt tgccaettgt eatttttgtt tctgaaaaat aaaagtataa ettatteaca 957 

^'^liaaaaaaaaaa mms 970 



''Jt210> 271 
;fe211> 645 
hk212> DNA 
y1<:213> Homo sapiens 

Lli<220> 
n|c221> CDS 
r:^222> 90 . .383 

.Jj<221> sigjieptide 

J;[^222> 90. .200 

^^^^222> Von Heijne matrix 

score 4.90000009536743 
seq MLIMLGIFFNVHS/AV 



<221> polyA_signal 
<222> 609. .614 



<221> polyA__site 
<222> 632 . .643 

<400> 271 

atctctgccc ccctgcgagg gcatcctggg ctttctccca cegctttccg agcccgcttg 60 
cacetcggcg atcccegact cecttcttt atg gcg teg etc ctg tgc tgt ggg 113 

Met Ala Ser Leu Leu Cys Cys Gly 
-35 -30 
ceg aag ctg gee gee tgc ggc ate gtc etc age gee tgg gga gtg ate 161 
Pro Lys Leu Ala Ala Cys Gly He Val Leu Ser Ala Trp Gly Val He 

-25 -20 -15 

atg ttg ata atg etc gga ata ttt ttc aat gtc cat tee get gtg ttg 209 
Met Leu He Met Leu Gly He Phe Phe Asn Val His Ser Ala Val Leu 



-210- 









-10 










-5 








1 




att 


gag 


gac 


gtt 


ccc 


ttc 


a eg 


gag 


aaa 


gat 


ttt gag 


aac 


ggc ccc car 


257 


He 


Glu 
5 


Asp 


Val 


Pro 


Phe 


Thr 
10 


Glu 


Lys 


Asp 


Phe Glu 
15 


Asn 


Gly Pro Gin 




aac 


ata 


tac 


aac 


ctt 


tac 


rag 


caa 


ktc 


age 


tac aac 


tgt 


ttc ate get 


305 


Asn 


He 


Tyr 


Asn 


Leu 


Tyr 


Xaa 


Gin 


Xaa 


Ser 


Tyr Asn 


Cys 


Phe lie Ala 




20 










25 










30 




35 




gca 


ggc 


ctt 


tac 


etc 


etc 


etc 


gga 


ggc 


ttc 


tet ttc 


tgc 


caa ktt egg 


353 


Ala 


Gly 


Leu 


Tyr 


Leu 
40 


Leu 


Leu 


Gly 


Gly 


Phe 
45 


Ser Phe 


Cys 


Gin Xaa Arg 
50 




etc 


aat 


aag 


cgc 


aag 


gaa 


tac 


atg 


gtg 


cgc 


tagggececg gcgcgtttcc 


403 


Leu 


Asn 


Lys 


Arg 
55 


Lys 


Glu 


Tyr 


Met 


Val 
60 


Arg 











ccgctccagc cecteeteta tttaaaract ccetgeaccg tktcacceag gtcgegtccc 463 

aceettgccg gcgccctctg tgggactggg tttcccgggc rararactga atceettctc 523 

ccatctetgg catceggcec ccgtggarar ggetgaggct ggggggctgt teegtctctc 583 

eaceettege tgtgtcccgt atetcaataa agagaatctg ctctcttcaa aaaaaaaaaa 643 

my 645 



,.i^210> 272 

';jk211> 773 

'!^212> DNA 

J'^x:213> Homo sapiens 

y'lc220> 
^|5i<221> CDS 
y,|<222> 332 . .541 

<221> sig__peptide 

|^l<:222> 332. .376 

^:^i!<223> Von Heijne matrix 

score 3.59999990463257 
seq FLPCCLLWSVFNP/ES 

Hi<22X> polyA^signal 
0;k222> 739. .744 



<221> polyA_site 
<222> 761. .773 



<400> 272 

aaaaeaattc atgcctttca tagtttatta ttattaaagt ctaaacaaaa ttgcaatttc 60 
ttaggtaace ttatatttac aataaatgaa gattaccetc aaatgetaga agctgtctag 120 
gtccgtecgg tgtgteagat ttteeteaga ttagatgtgc eaataaccaa gtttattcag 180 
taaacaaett gtaettgttt catctggttt tattactetc acecataaac agtaatgact 240 
ctetgaccct etggaaatat gtaatgcttc caatcttget ttgtgtatct catttaattt 3 00 
gttataaggt agtactgatt ttageatatt a atg cga ttt ctt eet tgt tgt 352 

Met Arg Phe Leu Pro Cys Cys 
-15 -10 
ttg ctt tgg tct gtg ttc aat cea gag age tta aat tgt eat tat ttt 400 
Leu Leu Trp Ser Val Phe Asn Pro Glu Ser Leu Asn Cys His Tyr Phe 

-5 15 
ghk ndd gaa amc tgt att ttt gyt agt tta caa tat tat gaa att tea 448 
Xaa Xaa Glu Xaa Cys He Phe Xaa Ser Leu Gin Tyr Tyr Glu He Ser 
10 15 20 



-211- 



ctt cag gag aaa ctg ctg ggc ttc ctg tgg ctt tgt ttt ctt agt tac 496 

Leu Gin Glu Lys Leu Leu Gly Phe Leu Trp Leu Cys Phe Leu Ser Tyr 

25 30 35 40 

ttt ttc cgt gcc gtg tat ttt tta att gat ttt tct tct ttt act 541 

Phe Phe Arg Ala Val Tyr Phe Leu lie Asp Phe Ser Ser Phe Thr 

45 50 55 

tgaaaagaaa gtgttttatt ttcaaatctg gtccatattt acattctagt tcagagccaa 601 
gccttaaact gtacagaatt tccactgtaa ttaaaactat ttagtgttag ttataaatag 661 
ccttcaaaaa gagagattct ccattacacg atcacctgca tcacagccca tggtgaatgt 721 
atgtttctgc atagcgaaat aaaaatggca aatgcactga aaaaaaaaaa aa 773 



<210> 273 
<211> 566 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 43. .222 

''^!f:221> sig_peptide 

:Jic222> 43. .177 

J ■4:223 > Von Heijne matrix 

f^' score 4 

m seq ENFLSLLSKSCSA/DP 

U,|c221> polyA^signal 
y^|c222> 530, .535 

\^^^221> polyA^site 
i=^j<222> 555. .566 

''^40 0> 2 73 

';%acgagtgga ggtgtggcta gtggctgtga tgagataaat cc atg cat age ctt 54 

Met His Ser Leu 

U -45 
ttc att gcg age ttg aaa gtt ctt ttc tat tac agt ttt age ttt agg 102 
Phe lie Ala Ser Leu Lys Val Leu Phe Tyr Tyr Ser Phe Ser Phe Arg 

-40 -35 -30 

ttt aat tgg ttc gac tgc ctt etc cac aat ttg ggc gag aat ttc ctt 150 
Phe Asn Trp Phe Asp Cys Leu Leu His Asn Leu Gly Glu Asn Phe Leu 
-25 -20 -15 -10 

age ctt etc age aaa agt tgt tct gcg gac ceg tct ggg tea act ttc 198 
Ser Leu Leu Ser Lys Ser Cys Ser Ala Asp Pro Ser Gly Ser Thr Phe 

-5 15 
atg agg gac att gag aca aac aaa tgaaatatgg gttaaagtac tetgagcagc 252 
Met Arg Asp lie Glu Thr Asn Lys 

10 15 
tacaaaaaga araccagtet atectgetgg agacagtggc caegtgaara aagagetett 312 
gcagtatgaa agaceacatg gaaagagagg ccacatggaa ceaacagtca gcatettggt 372 
ttcggacaeg tgaaraaatt eatctearac tgtgtateet aaateaggea cttgetgaat 432 
etaactacat gagtgagacc agttgacaac acatggagea racatgaget gttctcagtg 4 92 
arteetacac aaattectga eteacaacac tgtgagcaat aaaatggttg ttattttaag 552 
ceaaaaaaaa aaaa SSG 



-212- 



<210> 274 
<211> 455 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 115. .231 



<221> sig_peptide 
<222> 115. .180 
<223> Von Heijne matrix 
score 5 

seq HLFVTWSSQRALS'/HP 



<221> polyA_signal 
<222> 419. .424 



<221> polyA_site 
.;.<222> 445. .455 

;'k400> 274 

';*kacctgccag tkatgcaaat gccaaaatgt gggtcatcat atagtatatt tgaaaccttt 60 
'^Jbtgaacatgt acaccaccca atgctagagg ctgacttgga aaccggtggg tgca atg 117 

I'pcc gag get gtg gaa caa tea gcc cat etc ttt gtg acc tgg age agt 165 
4^ro Glu Ala Val Glu Gin Ser Ala His Leu Phe Val Thr Trp Ser Ser 
I -20 -15 -10 

cag agg gcc etc agt cac ccc gcc cca ttc etc acc ara raa aar aat 213 
,,|.Gln Arg Ala Leu Ser His Pro Ala Pro Phe Leu Thr Xaa Xaa Lys Asn 
ir^ 1 5 10 

^ica ttt eta tgg aag etc tgacgtaact tcagtgtttt etacaatact 261 
"Pro Phe Leu Trp Lys Leu 

J|;bctcctgccc cgccccatta aaacagttct tttgttaaaa aatavcctaa tggtccaact 321 
■Jittgctgtctg ttcttccaaa tgtttataat acacattatt tataaatatg tctgtttggg 381 
aagctaagaa caagctagtt tttacaacac aaatggaaat aaatgeaatt attataaaaa 441 
tycaaaaaaa aaaa 455 



<210> 275 
<211> 673 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 232. .384 

<221> sig_peptide 

<222> 232 . .300 

<223> Von Heijne matrix 

score 3.70000004768372 
seq FFLCAAFPLGAGV/KM 



-213- 



<221> polyA_signal 
<222> 650. .655 

<221> polyA__site 
<222> 662 . .673 



<400> 275 

atttggcttg cagactgcct tctatcccag aacagctgag aaatctatga agctgagatt 60 
ctgaaggacc cagcttaggt tcttccactt aggcctcaat tcccttcctt ttccaggggc 120 
agccttagtt tcccatggcc ctgaaacaca cacatttccc ccttcctttc ccagaagcca 180 
ctggcccccc atagcaccca gtgcatcctt tttacaagtg gaagaactag g atg get 237 

Met Ala 

ttc caa agt ctt eta gaa atg aag ttc ttt etc tgt gca get ttc eec 285 
Phe Gin Ser Leu Leu Glu Met Lys Phe Phe Leu Cys Ala Ala Phe Pro 

-20 -15 -10 

Ctt gga gea gga gtg aag atg ttt cat tat ett ggg cet ggg aaa cca 333 
Leu Gly Ala Gly Val Lys Met Phe His Tyr Leu Gly Pro Gly Lys Pro 
-5 15 10 

Ctt eyy cag get tct cce tec eec eac ccc eat agg amc agg att tgg 381 
Leu Xaa Gin Ala Ser Pro Ser Pro His Pro His Arg Xaa Arg lie Trp 
15 20 25 

'"Ji'cct tagcttctgg geetatcsgc tgccttceet cttyttceta ceacctcttc 434 

f4:gccttectt trawctctgt tgggettggg gatcttagtt ttcttttgtt tatttcceat 494 

;'J;tcatttttt tcttctggte agttttttta agggggggtg ttgtggtttt ttgtttttgt 554 

ytttgcttctg aaaaarcatt tgeetttctt cetetcccaa cataacaatc gtggtaacag 614 

iliaatgcgactg etgatttacc gatgtattta atgtaagtaa aaaaaggaaa aaaaraaaa 673 



yL<210> 276 

|:|H<211> 639 

r'<212> DNA 

f''k213> Homo sapiens 

^&22 0> 
0;|<221> CDS 

<222> 143 . .427 

<221> sig_peptide 
<222> 143. .286 
<223> Von Heijne matrix 
score 7 . 5 

seq FVILLLFIFTWS/LV 

<221> polyA_signal 
<222> 606 . .611 

<221> polyA__site 
<222> 628 . .639 

<400> 276 

aatcgcttca gcagcatcet etcagacaag agccactatt tctgattcag atcacctgtc 60 
atcgaagttt aaagaagggg aaaeaggaga cagaaataca ctgaaceaaa aagattcaaa 120 
agagcaagtg gaatctctaa ga atg get tec age cac tgg aat gaa ace act 172 

Met Ala Ser Ser His Trp Asn Glu Thr Thr 



-214- 



-45 -40 
acc tct gtt tat cag tac ctt ggt ttt caa gtt caa aaa att tac cct 22 0 

Thr Ser Val Tyr Gin Tyr Leu Gly Phe Gin Val Gin Lys lie Tyr Pro 

-35 -30 -25 

ttc cat gac aac tgg aac act gcc tgc ttt gtc ate ctg ctt tta ttt 268 
Phe His Asp Asn Trp Asn Thr Ala Cys Phe Val lie Leu Leu Leu Phe 

-20 -15 -10 

ata ttt aca gtg gta tct tta gtg gtg ctg get ttc ctt tat gaa gtg 316 
lie Phe Thr Val Val Ser Leu Val Val Leu Ala Phe Leu Tyr Glu Val 

-5 15 10 

ctt gam wgc tgc tgc tgt gta aaa aac aaa acc gtg aaa gac ttg aaa 364 
Leu Xaa Xaa Cys Cys Cys Val Lys Asn Lys Thr Val Lys Asp Leu Lys 

15 20 25 

agt gaa ccc aac cct ctt ara akt atg atg gac aac ate aga aaa cgt 412 
Ser Glu Pro Asn Pro Leu Xaa Xaa Met Met Asp Asn lie Arg Lys Arg 

30 35 40 

gaa act gaa gtg gtc taacactcta taraaaatga acaaaatctc tgaaagcagc 467 
Glu Thr Glu Val Val 
45 

tcaacctctt ctgaraaaaa aaatatattc tgaggccaac tgttgctaca aaacaaattc 527 
tgactgaatg gttaaaacat ttctagtara aggggaaaaa aaakttaaac atgcactgtt 587 
^^4:gtgtgtata sccatttcat taaatataca gtaaaactyc aaaaaaaaaa aa 639 



;'<210> 277 
^^jc211> 772 
;fc212> DNA 
y,k213> Homo sapiens 

; <220> 

,^221> CDS 
)\^222> 284, .463 

f22'L> sig_jpeptide 
';f<222> 284. .379 
f^<223> Von Heijne matrix 
W score 3.79999995231628 

seq TFINITLWLGSLC/QR 

<221> polyA_site 
<222> 762 , .772 

<400> 277 

acagctgggg ctttgtcttc tttattgcta ggagaatgta gcaatagaag ttctcatcgc 60 
cc tgt at tgc acttttggtt ttaaggactg gacccagagt tcctgaaagc caaactccat 120 
aagctgctca gtaagttcca agcacatagc cggctkhggg atgcgattcg gtcgaggtct 180 
gttgaatgaa ggtagacgca gcaggcagtt tgtccttacc agtgacctgg aagacggtgg 240 
cacttcctga gtgagctcac ttaccttccc tgaatggtga ggc atg gat gaa tat 295 

Met Asp Glu Tyr 
-30 

tec tgg tgg tgc eac gtg tta gag gtg gta aag ggt caa atg ttt act 343 
Ser Trp Trp Cys His Val Leu Glu Val Val Lys Gly Gin Met Phe Thr 

-25 -20 -15 

ttt att aat att aca tta tgg ctt ggt tct ctg tgt cag ega ttt ttc 3 91 

Phe lie Asn lie Thr Leu Trp Leu Gly Ser Leu Cys Gin Arg Phe Phe 
-10 -5 1 



-215- 



tat gcc teg ggt act tat ttc eta ata tat ate age aca gta aeg cct 439 
Tyr Ala Ser Gly Thr Tyr Phe Leu lie Tyr lie Ser Thr Val Thr Pro 
5 10 15 20 

age tgg agg ctt tgt ctt gtt agt tgataaatta gtggtaacag gtagatttgg 493 
Ser Tirp Arg Leu Cys Leu Val Ser 
25 

ttacctceca aagtgctggg attrcagacg tgagecaccg cgcetggccg aaacaattct 553 
tttgaaagag agaagtctcc etgtgttgeg eaggctggtc tcagaetcct ggggteaagt 613 
gagcctcetg ctttcgcete ctaaagtgct gggattacag gcgtgagcea ccgcacecgg 673 
acagatgtgt tgattttaaa gtgggtatga ggcctgagcc etggagtttg agaeeagcct 733 
ggacaaeatg gcaagaccct gtctctccaa aaaaaaaaa 772 



<210> 278 
<211> 840 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
^^22> 162. .671 

:;:'ji<221> sig_j5eptide 

^^;^^222> 162 . .398 

f'*^<223> Von Heijne matrix 

Ul score 4.09999990463257 

M seq QGVLFICFTCARS/FP 

y'k221> polyA^signal 
<222> 805. .810 

'v\f^22X> polyA_site 
r!<222> 830. .840 

'^JJ<400> 278 

^Ijlaaaaactgag gcctgggagc aggaacctgt aggcagcget tgagggtage gggatagcag 60 
Cil;fc;tgcaacgeg cgtgggagge gggggctctg ggcggaacaa aaatcacagg atgtcagagg 120 
atgtttcceg ggaagaactg ggataaaggg gtcccagcac c atg gag gac ccg aae 176 

Met Glu Asp Pro Asn 
-75 

cct gaa gag aac atg aag cag cag gat tea ccc aag gag aga agt ccc 224 
Pro Glu Glu Asn Met Lys Gin Gin Asp Ser Pro Lys Glu Arg Ser Pro 

-70 -65 -60 

cag age cca gga gge aac ate tgc eae ctg ggg gcc ccg aag tgc ace 272 
Gin Ser Pro Gly Gly Asn lie Cys His Leu Gly Ala Pro Lys Cys Thr 

-55 -50 -45 

ege tgc etc ate acc ttc gca gat tec aag ttc cag gag egt cac atg 320 
Arg Cys Leu He Thr Phe Ala Asp Ser Lys Phe Gin Glu Arg His Met 

-40 -35 -30 

aag egg gag cac cca gcg gac ttc gtg gee cag aag ctg cag ggg gtc 368 
Lys Arg Glu His Pro Ala Asp Phe Val Ala Gin Lys Leu Gin Gly Val 

-25 -20 -15 

etc ttc ate tgc ttc acc tgc gee ege tee ttc ccc tec tec aaa gee 416 
Leu Phe He Cys Phe Thr Cys Ala Arg Ser Phe Pro Ser Ser Lys Ala 
-10 -5 15 

ckr rkc ace cac ear ege age eae ggt cca rec gcc aag ccc acc ctg 464 



-216- 



Xaa Xaa Thr His Gin Arg Ser His Gly Pro Xaa Ala Lys Pro Thr Leu 

10 15 20 

ccg gtt gca acc act act gcc car ccc acc ttc cct tgt cct gac tgt 512 
Pro Val Ala Thr Thr Thr Ala Gin Pro Thr Phe Pro Cys Pro Asp Cys 

25 30 35 

ggc aaa acc ttt ggg cag get gtt tct ctg arg egg cac esc caa atr 560 
Gly Lys Thr Phe Gly Gin Ala Val Ser Leu Xaa Arg His Xaa Gin Xaa 

40 45 50 

cat gar gtc cgt gcc cct cct ggc acc ttc gcc tgc aca rad tgc ggt 608 
His Glu Val Arg Ala Pro Pro Gly Thr Phe Ala Cys Thr Xaa Cys Gly 
55 60 65 70 

cag gac ttt get car gaa rca ggg ctg cat caa cac tac att egg cat 656 
Gin Asp Phe Ala Gin Glu Xaa Gly Leu His Gin His Tyr lie Arg His 

75 80 85 

gcc egg ggg gga etc tgagtteagc ttaagcctct ceaeggtgac gggtggctct 711 
Ala Arg Gly Gly Leu 

90 

gtggctggta ggactcacee atgatatggg gtgeaggaac tctgggggce etgaaggatt 771 
tgcttccctc ecetgggaag gcagagggct cttaataaag aggaeeeaka agattcttaa 831 
aaaaaaaaa 840 



^f210> 219 

^Jc211> 840 

'J:212> DNA 

'^k213> Homo sapiens 

r^J<220> 

J'k221> CDS 

f <222> 63 . .632 

■]|<221> sig_peptide 

J^222> 63 . .308 

^^^223> Von Heijne matrix 

score 4.40000009536743 
f;f seq NLPHLQWGLTWG/HI 

<221> polyA_signal 
<222> 808. .813 



<221> polyA_site 
<222> 829. ,840 



<400> 279 

aacttccggt cgcgccascg cccgttgcea gttctgegcg tgteetgcat etccagtatg 60 
ga atg tat gtd tgg eee tgt get gtg gtc etg gee cag tac ett tgg 107 
Met Tyr Val Trp Pro Cys Ala Val Val Leu Ala Gin Tyr Leu Trp 
-80 -75 -70 

ttt cac aga aga tct etg cca ggc aag gee ate tta gag att gga get 155 
Phe His Arg Arg Ser Leu Pro Gly Lys Ala lie Leu Glu He Gly Ala 
-65 -60 -55 

gtg age ctt cca gga att ttg get gee aaa tgt ggt gca gaa gta 203 
Gly Val Ser Leu Pro Gly He Leu Ala Ala Lys Cys Gly Ala Glu Val 

-50 -45 -40 

ata ctg tea gac age tea gaa ctg cct cac tgt ctg gaa gtc tgt egg 251 
He Leu Ser Asp Ser Ser Glu Leu Pro His Cys Leu Glu Val Cys Arg 



-217- 



-35 -30 -25 -20 

caa age tgc caa atg aat aac ctg cca cat ctg cag gtg gta gga eta 299 

Gin Ser Cys Gin Met Asn Asn Leu Pro His Leu Gin Val Val Gly Leu 

-15 -10 -5 

aca tgg ggt cat ata tct tgg gat ctt ctg got eta cca cca caa gat 347 

Thr Trp Gly His lie Ser Trp Asp Leu Leu Ala Leu Pro Pro Gin Asp 

15 10 

att ate ctt gea tct gat gtg tte ttt gaa eea gaa rat ttt gaa gac 395 

He He Leu Ala Ser Asp Val Phe Phe Glu Pro Glu Xaa Phe Glu Asp 

15 20 25 

att ttg get aea ata tat ttt ttg atg eae aar aat eec aag gte caa 443 

He Leu Ala Thr lie Tyr Phe Leu Met His Lys Asn Pro Lys Val Gin 
30 35 40 45 

ttg tgg tct act tat caa gtt agg art get gac tgg tea ett gaa get 491 

Leu Trp Ser Thr Tyr Gin Val Arg Xaa Ala Asp Trp Ser Leu Glu Ala 

50 55 60 

tta etc tae aaa tgg gat atg aaa tgt gte eac att cet ctt gag tct 539 

Leu Leu Tyr Lys Trp Asp Met Lys Cys Val His He Pro Leu Glu Ser 

65 70 75 

ttt gat gea gac aaa gaa rat ata gea gaa tct ace ctt eea gga aga 587 

Phe Asp Ala Asp Lys Glu Xaa He Ala Glu Ser Thr Leu Pro Gly Arg 

j^l'fiat aea gtt gaa atg ctg gte att tee ttt gea aag gae agt etc 632 
;,^'|lis Thr Val Glu Met Leu Val He Ser Phe Ala Lys Asp Ser Leu 

';'.|"Jtgaattatae etaeaacctg ttctgggaea gtatcaatae tgatgagcaa cctggeaeae 692 

^;^^kaaetatgag cagaccaett eagettgaga atgcagtggg tetgaagatg gteaagtctg 752 

i'^tttgecttar attttgatgt cacctagaea aeaettaaac teatatgaaa caaaaattaa 812 

^iii^ataegtatt aeaageaaaa aaaaaaaa 840 



]|<210> 280 
^JJ<211> 849 
f^^212> DNA 
^J:<213> Homo sapiens 



.^^k220> 
<221> CDS 
<222> 21. .362 

<221> sig_peptide 

<222> 21. .200 

<223> Von Heijne matrix 

score 4.80000019073486 

seq LVILSLKSQTLDA/ET 

<221> polyA__signal 
<222> 821. .826 

<221> polyA_site 
<222> 838 . .849 

<400> 280 

agtaagtcce eccgcetege atg atg get geg gtg ceg eeg gge etg gag eeg 53 

Met Met Ala Ala Val Pro Pro Gly Leu Glu Pro 
-60 -55 -50 



-218- 



tgg aac cgt gtg aga ate cct aag gcg ggg aac cgc age gca gtg aca 101 
Trp Asn Arg Val Arg lie Pro Lys Ala Gly Asn Arg Ser Ala Val Thr 

-45 -40 -35 

gtg cag aac ccc ggc gcg gcc ctt gac ctt tgc att gca get gta att 149 
Val Gin Asn Pro Gly Ala Ala Leu Asp Leu Cys lie Ala Ala Val lie 

-30 -25 -20 

aaa gaa tgc cat etc gtc ata ctg teg etg aag age caa ace tta gat 197 
Lys Glu Cys His Leu Val lie Leu Ser Leu Lys Ser Gin Thr Leu Asp 

-15 -10 -5 

gca gaa aca gat gtg tta tgt gca gtc ctt tac age aat cac aac aga 245 
Ala Glu Thr Asp Val Leu Cys Ala Val Leu Tyr Ser Asn His Asn Arg 

15 10 15 

atg ggc cgc cac aaa ccc cat ttg gcc etc aaa cag gtt gag caa tgt 2 93 

Met Gly Arg His Lys Pro His Leu Ala Leu Lys Gin Val Glu Gin Cys 

20 25 30 

tta aag cgt ttg aaa aac atg aat ttg gag ggc tea att caa gac ctg 341 
Leu Lys Arg Leu Lys Asn Met Asn Leu Glu Gly Ser lie Gin Asp Leu 

35 40 45 

ttt gag ttg ttt tct tee aag taagtaagtg gtccarttgc tttgtgatgt 3 92 

Phe Glu Leu Phe Ser Ser Lys 
50 

jpgtgggctgg gaactcaatg tettgtgatc kcccttwgga ttketctakg ctygckgttg 452 
Siigaatataacc aattataccw cagctgtaka aatwttgttt taatgtgggg taccyggtgt 512 
^"jktgtggtaat ettctgacat tgatetatgg gartgactgg tgtgacattg aaatctgggt 572 
^:^catggtagat tatattaaaa catcagtggg ctgttattgt gcttaactac ctcaagttga 632 
'l^gcttaaagca agtcttcact tgaaaactge tatagaaatg ctttatattt aaaaatgaaa 6 92 
J^^gtaatgggar mttgcacata getgaaaatg tgaagggtcg cecagggagg amatggaage 752 
^^tctgtgcttc ttctgecata ccttgeccta tgcatctctt tgtttcaatc ctttgtcata 812 
^jfccctttataa taaactggta aatgtaaaaa aaaaaaa 849 



iii<210> 281 
^^'<211> 1344 
■^'^<212> DNA 
^[;J<213> Homo sapiens 

'3<22Q> 
<221> CDS 
<222> 21. .503 

<221> sig_j)eptide 

<222> 21. .344 

<223> Von Heijne matrix 

score 5.30000019073486 

seq ACMTLTASPGVFP/SL 

<221> polyA_signal 
<222> 1305. .1310 

<221> polyA_site 
<222> 1330. .1341 

<400> 281 

aaacaactcc ggaaagtaca atg ace age ggg cag gee ega get tec wye cag 53 

Met Thr Ser Gly Gin Ala Arg Ala Ser Xaa Gin 
-105 -100 



-219- 



tec 


eee 


eag 


gee 


ctg 


gag 


gae 


teg 


ggc 


ccg 


gtg 


aat 


ate 


tea 


gtc 


tea 


101 


Ser 


Pro 


Gin 


Ala 


Leu 


Glu 


Asp 


Ser 


Gly 


Pro 


Val 


Asn 


He 


Ser 


Val 


Ser 








-95 










-90 










-85 










ate 


aec 


eta 


aec 


ctg 


gac 


cca 


ctg 


aaa 


ccc 


tte 


gga 


ggg 


tat 


tec 


cgc 


149 


He 


Thr 


Leu 


Thr 


Leu 


Asp 


Pro 


Leu 


Lys 


Pro 


Phe 


Gly 


Gly 


Tyr 


Ser 


Arg 






-80 










-75 










-70 












aac 


gtc 


ace 


cat 


ctg 


tac 


tea 


aec 


ate 


tta 


ggg 


eat 


eag 


att 


gga 


ctt 


197 


Asn 


Val 


Thr 


His 


Leu 


Tyr 


Ser 


Thr 


He 


Leu 


Gly 


His 


Gin 


He 


Gly 


Leu 




-65 










-60 










-55 










-50 




tea 


ggc 


agg 


gaa 


gee 


cac 


gag 


gag 


ata 


aac 


ate 


ace 


tte 


aec 


ctg 


ect 


245 


Ser 


Gly 


Arg 


Glu 


Ala 


His 


Glu 


Glu 


He 


Asn 


He 


Thr 


Phe 


Thr 


Leu 


Pro 












-45 










-40 










-35 






aea 


gcg 


tgg 


age 


tea 


gat 


gae 


tge 


gcc 


etc 


cac 


ggt 


eae 


tgt 


gag 


eag 


293 


Thr 


Ala 


Trp 


Ser 


Ser 


Asp 


Asp 


Cys 


Ala 


Leu 


His 


Gly 


His 


Cys 


Glu 


Gin 










-30 










-25 










-20 








gtg 


gta 


ttc 


aca 


gee 


tge 


atg 


ace 


etc 


acg 


gee 


age 


cct 


ggg 


gtg 


ttc 


341 


Val 


Val 


Phe 


Thr 


Ala 


Cys 


Met 


Thr 


Leu 


Thr 


Ala 


Ser 


Pro 


Gly 


Val 


Phe 








-15 










-10 










-5 










eeg 


tea 


ctg 


tae 


age 


cac 


ege 


act 


gtg 


tte 


etg 


aca 


cgt 


aea 


gca 


acg 


389 


Pro 


Ser 


Leu 


Tyr 


Ser 


His 


Arg 


Thr 


Val 


Phe 


Leu 


Thr 


Arg 


Thr 


Ala 


Thr 






1 








5 










10 










15 




^'pea 


ege 


tct 


ggt 


aea 


aga 


tct 


tea 


caa 


ctg 


eea 


gag 


atg 


eea 


aca 


caa 


437 


iitro 


Arg 


Ser 


Gly 


Thr 


Arg 


Ser 


Ser 


Gin 


Leu 


Pro 


Glu 


Met 


Pro 


Thr 


Gin 












20 










25 










30 






''aat 


acg 


ccc 


aaa 


att 


aca 


ate 


ctt 


tct 


ggt 


gtt 


ata 


agg 


ggg 


cca 


ttg 


485 




Thr 


Pro 


Lys 


He 


Thr 


He 


Leu 


Ser 


Gly 


Val 


He 


Arg 


Gly 


Pro 


Leu 










35 










40 










45 









i-gaa aag tct ate atg ctt taaatcccaa gcttacagtg attgttccag 533 
Ujfelu Lys Ser He Met Leu 



Ul 50 

;i; atgatgaccg ttcattaata aatttgcate tcatgcacac cagttaettc ctctttgtga 593 

l^ditggtgataae aatgttttge tatgctgtta tcaagggeag acetagcaaa ttgegtcaga 653 

flpeaateetga attttgtcee gagaaggtgg ctttggctga ageetaattc eaeagctcet 713 

Lif,tgttttttga gagagactga gagaaccata atccttgeet gctgaaccca geetgggcct 773 

H^ggatgctetg tgaataeatt atettgcgat gttgggttat tccageeaaa gaeattteaa 833 

j!ptgeetgtaa etgatttgta eatatttata aaaatetatt cagaaattgg teeaataatg 8 93 

^Jjziacgtgettt geeetgggta eagecagage ecttcaaeec cacettggac ttgaggacet 953 

^-^4cctgatggg aegtttccae gtgtctctag agaaggatcc tggatctage tggtcaegae 1013 

gatgttttca ccaaggtcac aggagcattg egtcgctgat ggggttgaag tttggtttgg 1073 

ttettgttte agcccaatat gtagagaaca tttgaaaeag tctgeaectt tgatacggta 1133 

ttgcatttec aaagccacca atccattttg tggattttat gtgtctgtgg cttaataatc 1193 

atagtaaeaa eaataatace tttttetcea ttttgettgc aggaaaeata cettaagttt 1253 

tttttgtttt gtttttgttt ttttgttttt tgttttcett tatgaagaaa aaataaaata 1313 

gteaeatttt aatacyaaaa aaaaaaaame h 1344 



<210> 282 
<211> 671 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 1. .201 

<221> sig_peptide 



-220- 

<222> 1. ,63 

<223> Von Heijne matrix 

score 5.09999990463257 
seq LLLKIWLLQRPES/QE 

<221> polyA__signal 
<222> 637, ,642 

<221> polyA_site 
<222> 660. .671 



<400> 282 



atg 


ctg 


gga 


ggt 


gac 


cat 


agg 


get 


Ctg 


ctt 


tta 


aag 


ata 


tgg 


ctg 


ctt 


48 


Met 


Leu 
-20 


Gly 


Gly 


Asp 


His 


Arg 
-15 


Ala 


Leu 


Leu 


Leu 


Lys 
-10 


He 


Trp 


Leu 


Leu 




caa 


agg 


cca 


gag 


tea 


cag 


gaa 


gga 


Ctt 


ctt 


cca 


ggg 


aga 


tta 


gtg 


gtg 


96 


Gin 


Arg 


Pro 


Glu 


Ser 


Gin 


Glu 


Gly 


Leu 


Leu 


Pro 


Gly 


Arg 


Leu 


Val 


Val 




-5 










1 








5 










10 






atg 


gag 


agg 


aga 


gtt 


aaa 


aat 


gac 


etc 


atg 


tec 


ttc 


ttg 


tec 


acg 


gtt 


144 


Met 


Glu 


Arg 


Arg 
15 


Val 


Lys 


Asn 


Asp 


Leu 
20 


Met 


Ser 


Phe 


Leu 


Ser 
25 


Thr 


Val 




4:tg 


ttg 


agt 


ttt 


cac 


tct 


tct 


aat 


gca 


agg 


gtc 


tea 


cac 


tgt 


gaa 


cca 


192 


Iteu 


Leu 


Ser 


Phe 


His 


Ser 


Ser 


Asn 


Ala Arg Val 


Ser 


His 


Cys 


Glu 


Pro 








30 










35 










40 











! ;Jbtt agg atg tgatcacttt caggtggeca ggaatgttga atgtetttgg 241 
^■'IJjeu Arg Met 



45 

4'5:tcagtteat ttaaaaaaga tatctatttg aaagttctea rarttgtaca tatgttteac 3 01 

y,kgtacaggat etgtacataa aagtttcttt ectaaaccat tcaccaagag ceaatatcta 361 

Ij'ggcattttct tggtagcaca aattttctta ttgettaraa aattgtccte cttgttattt 421 

etgtttgtaa racttaagtg agttaggtct ttaaggaaag caacgctcet ctgaaatget 481 

^.■itgtctttttt ctgttgcega aatarctggt ectttttcgg gagttaratg tatarartgt 541 

=^nttgtatgtaa acatttcttg taggcatcac catgaacaaa gatatatttt ctatttattt 601 

J !;attatatgtg eacttcaaga agtcactgte agagaaataa agaattgtct taaatgteaa 661 

J'aaaaaaaaaa 671 



<210> 283 

<211> 1601 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 39, .1034 

<221> sig_peptide 

<222> 39. .134 

<223> Von Heijne matrix 

score 6.09999990463257 
seq LPLLTSALHGLQQ/QH 

<221> polyA_signal 
<222> 1566. .1571 

<221> polyA_site 



-221- 



<222> 1587. .1597 



<400> 283 

agccccagat cctgaaggag gtgcagagcc cagagggg atg ate keg ctg agg gac 56 

Met lie Xaa Leu Arg Asp 
-30 

aca get gcc tec etc egc ctt gag aga gac aca agg cag ttg eca ctg 104 
Thr Ala Ala Ser Leu Arg Leu Glu Arg Asp Thr Arg Gin Leu Pro Leu 

-25 -20 -15 

etc ace agt gee ctg cac gga ctg cag cag eag cac eca gee ttc tct 152 
Leu Thr Ser Ala Leu His Gly Leu Gin Gin Gin His Pro Ala Phe Ser 
-10 -5 15 

ggt gtg gca egg ctg gee aag egg tgg gtg cgt gee eag ctt ctt ggt 2 00 

Gly Val Ala Arg Leu Ala Lys Arg Trp Val Arg Ala Gin Leu Leu Gly 

10 15 20 

gag ggt ttc get gat gag age ctg gat ctg gtg gcc get gcc ctt ttc 248 
Glu Gly Phe Ala Asp Glu Ser Leu Asp Leu Val Ala Ala Ala Leu Phe 

25 30 35 

ctg cac cet gag cec ttc acc cct ccg agt tec ccc cag gtt ggc ttc 296 
Leu His Pro Glu Pro Phe Thr Pro Pro Ser Ser Pro Gin Val Gly Phe 
40 45 50 

P'ptt ega ttc ctt ttc ttg gta tea acg ttt gat tgg aag aac aac ccc 344 
iSteu Arg Phe Leu Phe Leu Val Ser Thr Phe Asp Trp Lys Asn Asn Pro 
^fi%S 60 65 70 

\'ptc ttt gtc aac etc aat aat gag etc act gtg gag gag cag gtg gar 392 
\^eu Phe Val Asn Leu Asn Asn Glu Leu Thr Val Glu Glu Gin Val Glu 
HI 75 80 85 

;l'atc egc agt ggc ttc ctg gca get egg gea cag etc ccc gtc atg gtc 440 
Ijile Arg Ser Gly Phe Leu Ala Ala Arg Ala Gin Leu Pro Val Met Val 
yi 90 95 100 

; att gtt ace ccc caa rac egc aaa aac tct gtg tgg aca cag gat gga 488 
4lle Val Thr Pro Gin Xaa Arg Lys Asn Ser Val Trp Thr Gin Asp Gly 
1| 105 110 115 

^fcc tea gcc car ate ctg cag cag ctt gtg gtc ctg gca get gaa sec 536 
Jp-ro Ser Ala Gin He Leu Gin Gin Leu Val Val Leu Ala Ala Glu Xaa 

120 125 130 

fptg ccc atg tta rar aas cag etc atg gat ccc egg gga cct ggg gac 584 
3ieu Pro Met Leu Xaa Xaa Gin Leu Met Asp Pro Arg Gly Pro Gly Asp 
135 140 145 150 

ate agg aca gkg ttc egg ceg ccc ttg gac att tac gac gtg ctg att 632 
lie Arg Thr Xaa Phe Arg Pro Pro Leu Asp He Tyr Asp Val Leu He 

155 160 165 

egc ctg tct cet egc cat ate ccg egg cac egc eag get gtg gac ter 680 
Arg Leu Ser Pro Arg His He Pro Arg His Arg Gin Ala Val Asp Ser 

170 175 180 

eca get gee tec ttc tge egg ggc ctg etc age cag ccg ggg ccc tea 728 
Pro Ala Ala Ser Phe Cys Arg Gly Leu Leu Ser Gin Pro Gly Pro Ser 

185 190 195 

tec ctg atg ccc gtg ctg ggc tak gat cet cct eag etc tat ctg acg 776 
Ser Leu Met Pro Val Leu Gly Xaa Asp Pro Pro Gin Leu Tyr Leu Thr 

200 205 210 

eag etc arg gag gcc ttt ggg gat ctg gcc ett ttc ttc tat gac eag 824 
Gin Leu Xaa Glu Ala Phe Gly Asp Leu Ala Leu Phe Phe Tyr Asp Gin 
215 220 225 230 

cat ggt gga gag gtg att ggt gtc etc tgg aag ccc ace age ttc cag 872 
His Gly Gly Glu Val He Gly Val Leu Trp Lys Pro Thr Ser Phe Gin 
235 240 245 



-222- 



ccg cag ccc ttc aag gcc tec age aea aag ggg egc atg gtg atg tet 92 0 
Pro Gin Pro Phe Lys Ala Ser Ser Thr Lys Gly Arg Met Val Met Ser 

250 255 260 

cga ggt ggg gag eta gta atg gtg eee aat gtt gaa gea ate ctg gag 968 
Arg Gly Gly Glu Leu Val Met Val Pro Asn Val Glu Ala lie Leu Glu 

265 270 275 

gac ttt get gtg ctg ggt gaa ggc ctg gtg cag act gtg gag gee ega 1016 
Asp Phe Ala Val Leu Gly Glu Gly Leu Val Gin Thr Val Glu Ala Arg 

280 285 290 

agt gag agg tgg act gtg tgateccagc tctggagcaa gctgtagacg 1064 
Ser Glu Arg Trp Thr Val 
295 300 

gacagcagga cattggaect etagagcaag atgtcagtag gatgacetce accctcettg 1124 

gaeatgaate etccatggag ggcctgctgg ctgaaeatgc tgaatcatct ecaaeaaaac 1184 

ccagccccaa ctttctctct gatgetccag cattggggea ggggcatggt ggcceatgta 1244 

gtctcctggg cetcaccate ceagaagagg agtgggagcc agctcagaga aggaactgaa 1304 

cecaggagat ecatccacct attagcectg ggcctggacc tecctgcgat ttcccactee 1364 

tttcttagte ttcttecaga aacagagaag gggatgtgtg cetgggagag getctgtcte 1424 

ettcctgetg ccaggacctg tgcctagact tagcatgece ttcactgeag tgteaggcct 1484 

ttagatggga eccagegaaa atgtggecct tctgagtcac ateaccgaca ctgageagtg 1544 

gaaagggget atatgtgtat gaatagacca cattgaagga geaaaaaaaa aaamech 1601 



;'^^210> 284 
^";J<211> 1206 
^{k212> DNA 
-Ei'^<213> Homo sapiens 

U1<22 0> 

<221> CDS 
Li^<222> 69. .263 

^ :J<221> sig_j)eptide 
\'f222> 69. .125 
',|J<223> Von Heijne matrix 
m score 3.90000009536743 

ffil seq ALSMSSFSFHSSS/CS 



<221> polyA_signal 
<222> 1173. .1178 



<221> polyA_site 
<222> 1196. .1205 



<400> 284 

acatttgtga ctttaceaat accetcccag ttettgatag acagctgtag gttgetgggt 60 
teaagaat atg ggt ggg ata tgg aat get ett tea atg tct age ttc agt 110 
Met Gly Gly lie Trp Asn Ala Leu Ser Met Ser Ser Phe Ser 
-15 -10 
ttt cat tea tec tee tgc tea gea ctg tea gee aag age tta etc age 158 
Phe His Ser Ser Ser Cys Ser Ala Leu Ser Ala Lys Ser Leu Leu Ser 
-5 15 10 

aga cac eac ata ctg cag cag ttc eta gtg aga aaa tct gtg eca eta 2 06 

Arg His His lie Leu Gin Gin Phe Leu Val Arg Lys Ser Val Pro Leu 

15 20 25 

gaa aat get tea ctt eca ttt cet cae ctg ggc agt tet ctg ttt aaa 254 



-223- 



Glu Asn Ala Ser Leu Pro Phe Pro His Leu Gly Ser Ser Leu Phe Lys 

30 35 40 

att gtg ggc tgatttggtc ttcctctcct cctcccactg ttactgccct 
He Val Gly 
45 



gcagcccttg 
acacaccctt 
agtttgctta 
aatggatttc 
gtattattat 
aactggagcc 
cctgaggcag 
gtagtctgtg 
caakttgagt 
araaagtaat 
tccttgttgg 
ctgtcttttt 
gaatccctas 
agttgtarac 
caacctgtaa 
aat 



ttcaggtgta 
ccgcccaaat 
ctcatartca 
tgtgtggcag 
taagggartt 
caaakaaatt 
tggctgccac 
acatggatgg 
ttttttccag 
cttttttaca 
tttagtccaa 
attatgaaaa 
ccttgttttc 
aggaaagctt 
agcattcttg 



cagaccctta 
acctctgacc 
tgtcctttct 
tggartgatt 
tgaraaagct 
cccttagggc 
cccttttcar 
ggatgctagg 
caggggttar 
aaactgtttt 
raratcattt 
actatgttaa 
cttaraarac 
gtgaraaaaa 
gaaktggcca 



ttctggcctc 
ccaaggctgg 
cttggcacct 
gcatgaattt 
ttgcttataa 
aagattatgt 
atgtttagtc 
gcccttaggg 

gggaggtact 

tctaggtggg 
gcaacaacag 
gggggaaaat 
ttgtttagtg 
caccacatgg 
gtaaaaaggg 



tagtgtcctt 
aatggggctg 
gcttccctgc 
ttctgtaaca 
tgtcaaggca 
tataataraa 
ctgcaaatag 
gcaaggggac 
csctgttgat 
tggaaagtga 
taratgtccg 
gtggattatg 
ttttatcara 
ascctgtaaa 
gttttaccat 



gtctgtcatg 
gtaggarata 
ggtgtcctca 
cattaacttt 
aggaggtaaa 
aattgaattt 
catctttctt 
taaactaaat 
atttgacact 
aactgccaca 
ggttttgttt 
gtaaccarag 
cgtctgttgt 
tgtttttgca 
ttaaaaaaaa 



303 



363 
423 
483 
543 
603 
663 
723 
783 
843 
903 
963 
1023 
1083 
1143 
1203 
1206 



J';^210> 285 
f'<211> 536 
'{1;212> DNA 
-|'^213> Homo sapiens 

IJ'|c220> 

<221> CDS 

|,|<:222> 115. .285 

[^^c221> sig_peptide 
['>222> 115. ,204 
^J<223> Von Heijne matrix 

score 3.70000004768372 
&I seq SMMLLTVYGGYLC/SV 

<221> polyA__signal 
<222> 505. .510 

<22l> polyA__site 
<222> 525. .536 

<400> 285 

acgagtgctg cgttcggctg tgctgggaag ttgcgtagac agtggcctcg agaccctgcc 60 
tgcctgagga ggcctcggtt ggatgcgaag gagctgcagc atccagggga caag atg 117 

Met 
-30 

cca act ggc aag cag eta get gac att ggc tat aag acc ttc tct acc 165 
Pro Thr Gly Lys Gin Leu Ala Asp lie Gly Tyr Lys Thr Phe Ser Thr 

-25 -20 -15 

tec atg atg ctt etc act gtg tat ggg ggg tac etc tge agt gtc ega 213 
Ser Met Met Leu Leu Thr Val Tyr Gly Gly Tyr Leu Cys Ser Val Arg 

-10 -5 1 

gtc tae cac tat ttc cag tgg cgc agg gcc cag cge eag gee gca gaa 261 



-224- 



Val Tyr His Tyr Phe Gin Trp Arg Arg Ala Gin Arg Gin Ala Ala Glu 

5 10 15 

gaa cag aag dac tea gga ate atg tagaactggg gggctttttc tcctgagcar 315 
Glu Gin Lys Xaa Ser Gly lie Met 
20 25 

asakgcccaa ggcatgctgt ggagagactt cacctgccac catttccagg tcaacaggac 375 
tagagcgttg atggttttca aaccctgttg gaagaaagtg cccatggttt ctctggttct 435 
gccartttga cagtttatgg argcttttga atcgtaatar caatgtgagg gtgargtaca 495 
cctacagaca ttaaataatt tgctgtgtca aaaaaaaaaa a 536 



<210> 286 
<211> 529 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 90. .344 

\^i^221> sig_jpeptide 

j;^222> 90. .140 

;r|<223> Von Heijne matrix 

score 8.19999980926514 
'U' seq LLLITAIIjAVAVG/FP 

l'x:221> polyA_signal 
4k222> 500. .505 

i <221> polyA_site 
.|<222> 515. .527 

^^r<400> 286 

^jaatatrarac agctacaata ttccagggcc artcacttgc catttctcat aacagcgtca 60 
;!;&agagaaaga actgactgar acgtttgag atg aag aaa gtt etc etc etg ate 113 
J;J Met Lys Lys Val Leu Leu Leu lie 

y -15 -10 

aea gee ate ttg gca gtg get gtw ggt tte eca gte tct eaa gac cag 161 
Thr Ala lie Leu Ala Val Ala Val Gly Phe Pro Val Ser Gin Asp Gin 

-5 15 
gaa cga gaa aaa aga agt ate agt gac age gat gaa tta get tea ggr 2 09 

Glu Arg Glu Lys Arg Ser lie Ser Asp Ser Asp Glu Leu Ala Ser Gly 

10 15 20 

wtt ttt gtg ttc ect tac cea tat cca ttt egc cca ett eca cea att 257 
Xaa Phe Val Phe Pro Tyr Pro Tyr Pro Phe Arg Pro Leu Pro Pro lie 

25 30 35 

cca ttt cca aga ttt cea tgg ttt aga egt aat ttt ect att cea ata 305 
Pro Phe Pro Arg Phe Pro Trp Phe Arg Arg Asn Phe Pro lie Pro lie 
40 45 50 55 

ect gaa tct gee ect aea act cec ett ect age gaa aag taaaeaaraa 3 54 

Pro Glu Ser Ala Pro Thr Thr Pro Leu Pro Ser Glu Lys 

60 65 
ggaaaagtca erataaacet ggteaeetga aattgaaatt gageeaettc ettgaaraat 414 
caaaattcet gttaataaaa raaaaacaaa tgtaattgaa atagcacaea gcattetcta 474 
gtcaatatct ttagtgatet tctttaataa acatgaaage aaaaaaaaaa aaace 529 



-225- 



<210> 287 
<211> 493 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 57. .311 

<221> sig_jpeptide 

<222> 57. .107 

<223> Von Heijne matrix 

score 8.19999980926514 
seq LLLITAILAVAVG/FP 

<221> polyA_signal 
<222> 467. .472 

<221> polyA__site 
^.«222> 482 . .493 



^;Jf400> 287 

:^4acttgccat ttctcataac agcgtcagag agaaagaact gactgaaacg tttgag atg 59 

Met 



iiiag 


aaa 


gtt 


etc 


etc 


ctg 


ate 


aca 


gee 


ate 


ttg 


gca 


gtg 


get 


gtt 


ggt 


107 


4|jys 


Lys 


Val 


Leu 


Leu 


Leu 


He 


Thr 


Ala 


He 


Leu 


Ala 


Val 


Ala 


Val 


Gly 




-15 










-10 










-5 










U'ttc 


oca 


gtc 


tct 


caa 


gac 


cak 


gaa 


ega 


gaa 


aaa 


aga 


agt 


ate 


agt 


gac 


155 


. Phe 


Pro 


Val 


Ser 


Gin 
5 


Asp 


Xaa 


Glu 


Arg 


Glu 
10 


Lys 


Arg 


Ser 


He 


Ser 
15 


Asp 




f] fgc 


gat 


gaa 


tta 


get 


tea 


ggg 


ttt 


ttt 


gtg 


ttc 


cct 


tac 


cca 


tat 


cca 


203 


rSer 


Asp 


Glu 


Leu 


Ala 


Ser 


Gly 


Phe 


Phe 


Val 


Phe 


Pro 


Tyr 


Pro 


Tyr 


Pro 










20 










25 










30 








';ttt 


cgc 


cca 


ctt 


cca 


cca 


att 


cca 


ttt 


cca 


aga 


ttt 


cca 


tgg 


ttt 


aga 


251 


;ljphe 


Arg 


Pro 
35 


Leu 


Pro 


Pro 


He 


Pro 
40 


Phe 


Pro 


Arg 


Phe 


Pro 
45 


Trp 


Phe 


Arg 




cgt 


aat 


ttt 


cct 


att 


cca 


ata 


cct 


gaa 


tct 


gee 


cct 


aea 


act 


cce 


ctt 


299 


Arg 


Asn 


Phe 


Pro 


He 


Pro 


He 


Pro 


Glu 


Ser 


Ala 


Pro 


Thr 


Thr 


Pro 


Leu 





50 55 60 

ceg age gaa aag taaacaagaa ggaaaagtca egataaacct ggtcacctga 351 
Pro Ser Glu Lys 
65 

aattgaaatt gagccacttc ettgargaat caaaattcet gttaataaaa gaaaaacaaa 411 
tgtaattgaa atagcacaca geattctcta gtcaatatct ttagtgatct tctttaataa 471 
acatgaaage aaaaaaaaaa aa 493 



<210> 288 
<211> 521 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 



-226- 

<222> 96. .302 

<221> sigjpeptide 
<222> 96. .182 
<223> Von Heijne matrix 
score 5 

seq ELSLLPSSLWVLA/TS 

<221> polyA_site 
<222> 501. .514 

<400> 288 

aagagacgtc accggctgcg cccttcagta tcgcggacgg aagatggcgt ccgccacccg 60 
tctcatccag cggctgcgga actgggcgtc cgggc atg acc tgc agg gga age 113 

Met Thr Cys Arg Gly Ser 
-25 



tgc 


age 


tae 


get 


aee 


agg 


aga 


tet 


cea 


age 


gaa 


etc 


age 


etc 


etc 


cea 


161 


Cys 


Ser 


Tyr 


Ala 


Thr 


Arg 


Arg 


Ser 


Pro 


Ser 


Glu 


Leu 


Ser 


Leu 


Leu 


Pro 










-20 










-15 










-10 








age 


tec 


ctg 


tgg 


gtc 


eta 


gcc 


aca 


age 


tct 


cea 


aca 


att 


act 


att 


gea 


209 


Ser 


Ser 


Leu 
-5 


Trp 


Val 


Leu 


Ala 


Thr 
1 


Ser 


Ser 


Pro 


Thr 


lie 


Thr 


He 


Ala 






gcg 


atg 


gcc 


gcc 


ggg 


aat 


ctg 


tgc 


ccc 


ctt 


5 

cea 


tea 


tea 


tkt 


cgt 


257 


li^'teu 


Ala 


Met 


Ala 


Ala 


Gly 


Asn 


Leu 


Cys 


Pro 


Leu 


Pro 


Ser 


Ser 


Xaa 


Arg 




;;:Jlo 










15 










20 










25 




!'"|crc 


aaa 


agg 


cgc 


tgg 


tgt 


cag 


gea 


asc 


ear 


caa 


ara 


get 


ctg 


ctg 




302 




Lys 


Arg 


Arg 


Trp 


Cys 


Gin 


Ala 


Xaa 


Gin 


Gin 


Xaa 


Ala 


Leu 


Leu 







3 0 35 40 

^:^itagetgecac tgaaaaraag geggtgaete eagcteetec eataaagagg tgggagctgt 362 

y'beteggacca geettacctg tgacaetgea cccteaegge eacccgacta etttgeetec 422 

^ii ttggatttcc tccagggaga atgtgaccta atttatgaca aataegtara geteaggtat 482 

tiicacttctagt tttaetttaa aaaataaaaa aatagagae 521 



j!|:210> 289 
;J;^211> 811 
Kl:212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 161, .526 

<221> sig_peptide 

<222> 161. .328 

<223> Von Heijne matrix 

score 4.19999980926514 

seq XSPLLTLALLGQC/SL 

<221> polyA_site 
<222> 799. .811 

<400> 289 

aaaaaattgc agtgctgaag acaetggace egeaaaaggc tgtcceteee aaaeetggga 60 

ttetgggetc actgagttea cetgegagtc agecctacct geactgctet ggtetagtac 120 

aaacaggctg etggcattga ggtctgctae aaaaanarta atg gtc cea tgg ccc 175 



-227- 



Met Val Pro Trp Pro 
-55 

agg ggc aag gtg aaa act get cct att ccc ate tct agg ttt cct ttc 223 
Arg Gly Lys Val Lys Thr Ala Pro lie Pro lie Ser Arg Phe Pro Phe 

-50 -45 -40 

etc cct acc cac gac cca ccc acc cca gca cat tgg tct cca gca tct 271 
Leu Pro Thr His Asp Pro Pro Thr Pro Ala His Trp Ser Pro Ala Ser 
-35 -30 -25 -20 

cat cag cag ttt aaa cat kkg tea ccc etc etc act ttg gee ctg ctg 319 
His Gin Gin Phe Lys His Xaa Ser Pro Leu Leu Thr Leu Ala Leu Leu 

-15 -10 -5 

ggt cag tgc tct ctg ttc arc aat ttg agg aaa aaa ett gca ggg caa 367 
Gly Gin Cys Ser Leu Phe Xaa Asn Leu Arg Lys Lys Leu Ala Gly Gin 

15 10 
aaa gca aaa aaa tta cct tee tte tec age ctg ccc ctg aca etc tgg 415 
Lys Ala Lys Lys Leu Pro Ser Phe Ser Ser Leu Pro Leu Thr Leu Trp 

15 20 25 

cca tta act eet caa ttt get gag etc act aca gtg gca caa aaa aaa 463 
Pro Leu Thr Pro Gin Phe Ala Glu Leu Thr Thr Val Ala Gin Lys Lys 
30 35 40 45 

ttg agg tgg tec ggg acc eta ggt tgg ggt cca gtt ccc age tgg gtt 511 
'-n^eu Arg Trp Ser Gly Thr Leu Gly Trp Gly Pro Val Pro Ser Trp Val 

50 55 60 

;:'ibaa ttt ttt tta ggg tgaatggagg garagttggg gaetgaaaas cettcaaara 566 
''bin Phe Phe Leu Gly 

65 

'^baatgttatt acageaktct ceccttatcc aaaktttcct tttcetgadt ttcagttage 626 

I'tatggteaae cgcttggaaa atakttgaae acagtaeaat aaratatttt gaggctggga 686 

i,ktggtggcte atgectgtaa taatcccagg actttgtgar accaaktttg aaggatcact 746 

J'tgaaeecagg aktttgarac cascetgggc aaeatrgtra gaectcatet ctacaaaaaa 8 06 

I aaaaa 811 



;">:210> 290 
^;J;211> 625 
'^lc212> DNA 
i^;k213> Homo sapiens 

<220> 
<221> CDS 
<222> 210. .332 

<221> sig_peptide 

<222> 210. .299 

<223> Von Heijne matrix 

score 8.10000038146973 

seq ITCLLAFWPASC/IQ 

<221> polyA__signal 
<222> 594. .599 

<221> polyA_site 
<222> 613 . .625 

<400> 290 

acaggtcsmc ttaacatcte ttgatttgag ceactcceac tgtcatcagc ttteacctgg 60 
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attatcgtga cagcctccta ctgcttctct atcatgtggc cagagctatc ttccctaaaa 120 
atgcattgca tagttgatca agtcactctc tggcctaaaa ccttccttgg ctccctgctg 180 
ccctcaggat aaagtctgga cccctcagc atg get tgt gag act cat ggt gtc 233 

Met Ala Cys Glu Thr His Gly Val 
-30 -25 
ctt gtc cct get cac etc tct ggt etc ate act tgc ctt ctt gca ttc 281 
Leu Val Pro Ala His Leu Ser Gly Leu lie Thr Cys Leu Leu Ala Phe 

-20 -15 -10 

tgg gtc cca gcc tec tgt ate eag aga tgc agt ggc tct cca ttg cea 329 
Trp Val Pro Ala Ser Cys lie Gin Arg Cys Ser Gly Ser Pro Leu Pro 

-5 15 10 

etc tgattcctcc tttcttttgg tcacagagaa agggtacttt ctetgtcaaa 382 
Leu 

tctcaactta gacttgactt cetccaagga getttggcta tactctctec cwcgaceccc 442 
accctggcat actaeaeara tcactctggg ctcacttgee tgeetaatgg teatctccce 502 
agtaaaetgt aagcteettg agggcaagga ttgtgttgga atttttgtat taacagtgce 562 
tggcttggtg cetggcacet aaaaagcact caataaatgt ttgtttaatg aaaaaaaaaa 622 
aaa 625 



,;.,<210> 291 
'i;|c211> 684 
:;:ik212> DNA 
:''''^213> Homo sapiens 

iric220> 
JJ<221> CDS 
y^k222> 212. .361 

<221> sig_jpeptide 
^^^l<:222> 212 . .319 
H;f^223> Von Heijne matrix 
rf score 4.09999990463257 

l1 seq HWLFLASLSGIKT/YQ 

y;k221> polyA_signal 
0l<222> 650. .655 

<221> polyA__site 
<222> 673 . .684 

<400> 291 

atccccawns cactctctca cagagactgt tcttttcctt ctgagaccct actccagctt 60 
gtagttctaa atctgtgatt atgcactgtc tgtettccte ttgaggteag gggceattte 120 
ttttgttctc tgctatgctc aggaeccaga tcaaaggage tcagtaacta tttacaggcg 180 
tacatcatat gtggaggaca ettatgctgt g atg gee cca cac aca get tec 232 

Met Ala Pro His Thr Ala Ser 
-35 -30 
ttt ggg gtc tgt ccc ctg etc tec gtt ace egc gtg gta gcc act gag 280 
Phe Gly Val Cys Pro Leu Leu Ser Val Thr Arg Val Val Ala Thr Glu 

-25 -20 -15 

cac tgg etc ttc ctg get tea etc tct ggc ate aaa act tat eag tec 328 
His Trp Leu Phe Leu Ala Ser Leu Ser Gly lie Lys Thr Tyr Gin Ser 

-10 -5 1 

tac ate tea gtc ttt tgc aag gtg aca ett ate tgattaecta attcaeacra 381 
Tyr lie Ser Val Phe Cys Lys Val Thr Leu He 
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5 10 

aggtgttaat ggtggtaatg gcataktatt tattacccca ggggacccak aacggtggta 441 

tcaaaacata tcattcccca gtggtttaaa actctggtag ctttccargg aatccaaagt 501 

ggaatccagt ctccttagct gawttcacag ggccccgtct gcacaacttg gcttctgtcg 561 

gcttccctan ccctgacttc ccaagcctta gtcatcaccc tctctcccac ccagggctca 621 

gcacagtacc tggaacagtc aagccctcaa taaatgttta ctgagtgcat yaaaaaaaaa 681 

aaa 684 



<210> 292 
<211> 628 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 75. .482 

<221> sig_peptide 

<222> 75. .128 
;=;s^^223> Von Heijne matrix 
% score 3.59999990463257 

ij*'^ seq KMLISVAMLGAXA/GV 

''^^221> polyA_signal 
Jk222> 595. .600 



yji<221> polyA^site 
y'K222> 618. .627 

|1.5400> 292 

njp^gtgagacc gcgcggcaac agcttgcggc tgcggggagc tcccgtgggc gctccgctgg 60 
r!;ctgtgcaggc ggcc atg gat tec ttg egg aaa atg ctg ate tea gte gca 110 
r':: Met Asp Ser Leu Arg Lys Met Leu He Ser Val Ala 

;is.^tg ctg ggc gca rgg get ggc gtg ggc tac gcg etc etc gtt ate gtg 158 
iJliNet Leu Gly Ala Xaa Ala Gly Val Gly Tyr Ala Leu Leu Val lie Val 
-5 15 10 

ace ccg gga gag egg egg aag cag gaa atg eta aag gag atg cca ctg 206 
Thr Pro Gly Glu Arg Arg Lys Gin Glu Met Leu Lys Glu Met Pro Leu 

15 20 25 

cag gae cea agg age agg gag gag gcg gee agg acc cag cag eta ttg 254 
Gin Asp Pro Arg Ser Arg Glu Glu Ala Ala Arg Thr Gin Gin Leu Leu 

30 35 40 

ctg gee act ctg cag gag gca gcg acc aeg cag gag aac gtg gee tgg 302 
Leu Ala Thr Leu Gin Glu Ala Ala Thr Thr Gin Glu Asn Val Ala Trp 

45 50 55 

agg aag aac tgg atg gtt ggc ggc gaa ggc ggc gee aeg gga kgt cae 350 
Arg Lys Asn Trp Met Val Gly Gly Glu Gly Gly Ala Thr Gly Xaa His 

60 65 70 

cgt gag acc gga ctt gee tec gtg ggc gee gga cet tgg ett ggg egc 398 
Arg Glu Thr Gly Leu Ala Ser Val Gly Ala Gly Pro Trp Leu Gly Arg 
75 80 85 90 

agg aat ccg agg cag ctt tct cet teg tgg gee can egg aaa ate egg 446 
Arg Asn Pro Arg Gin Leu Ser Pro Ser Trp Ala Xaa Arg Lys He Arg 
95 100 105 
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amc gaa aat wcc atg cca gga etc tec ggg gtc ctg tgaactgccg 492 
Xaa Glu Asn Xaa Met Pro Gly Leu Ser Gly Val Leu 

110 115 

tcgggtgagc acgtgtcccc caaaccctgg actgactgct ttaaggtccg caaggcgggc 552 

cagggccgag acgcgagtcg gatgtggtga actgaaagaa ccaataaaat catgttcctc 612 

cammcaaaaa aaaaah 628 



<210> 293 
<211> 813 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 50. .631 

<221> sig_peptide 

<222> 50. .244 

<223> Von Heijne matrix 
i:,^ score 8 

% seq LTLIGCLVTGVES/KI 

l'^221> polyA_signal 
^^^^222> 777. .782 

.|l<221> polyA^site 
^|<222> 801. .812 

*^' <400> 293 

ttactcgagc cttgttagaa tcagacatgg cttcagggg atg cag gac 58 

f'gct ccc ctg age tgc ctg tea ccg act 
'';5kla Pro Leu Ser Cys Leu Ser Pro Thr 
hI -60 -55 

fijfeca gac tea act gag aag tea gee tet 

Ala Asp Ser Thr Glu Lys Ser Ala Ser 
-45 -40 

cat ttt cag tte tgt etc egg cag get 

Pro Phe Gin Phe Cys Leu Arg Gin Ala 

-30 -25 

att ctg acc etc att ggc tgc ctg gtc 

lie Leu Thr Leu lie Gly Cys Leu Val 
-10 

tac act cgt tgc aaa ctg gea aaa ata 
Tyr Thr Arg Cys Lys Leu Ala Lys He 

5 10 
aat cyg agg ggc tte age ett gga aac 
Asn Xaa Arg Gly Phe Ser Leu Gly Asn 

20 25 
gag age ggc tac aac ace aca gee car 
Glu Ser Gly Tyr Asn Thr Thr Ala Gin 
35 40 
ate gac tay ggc ate tte caa ate aac 
He Asp Tyr Gly He Phe Gin He Asn 



Met Gin Asp 
-65 

aag tgg age agt gtt tet tec 106 
Lys Trp Ser Ser Val Ser Ser 
-50 

gcg gea ggc ace agg aat ctg 154 
Ala Ala Gly Thr Arg Asn Leu 
-35 

ttg agg atg aag get gcg ggc 202 
Leu Arg Met Lys Ala Ala Gly 
-20 -15 
aca ggc gtc gag tec aaa ate 250 
Thr Gly Val Glu Ser Lys He 
-5 1 
tte teg agg get ggc ctg gac 298 
Phe Ser Arg Ala Gly Leu Asp 
15 

tgg ate tgc atg gcg tat tat 346 
Trp He Cys Met Ala Tyr Tyr 
30 

aeg gtc ctg gat gac ggc age 394 
Thr Val Leu Asp Asp Gly Ser 

45 50 
age tte gcg tgg tgc aga cge 442 
Ser Phe Ala Trp Cys Arg Arg 
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55 60 65 



gga 


aag 


ctg 


aag 


gag 


aac 


aac 


cac 


tgc 


cay 


gte gcc 


tgc 


tea 


gcc 


ttg 


Gly 


Lys 


Leu 


Lys 


Glu 


Asn 


Asn 


His 


Cys 


His 


Val Ala 


Cys 


Ser 


Ala 


Leu 








70 










75 








80 






rtc 


act 


gat 


gac 


etc 


aca 


gat 


gca 


att 


ate 


tgt gcc 


arg 


aaa 


att 


gtt 


Xaa 


Thr 


Asp 


Asp 


Leu 


Thr 


Asp 


Ala 


He 


He 


Cys Ala 


Xaa 


Lys 


He 


Val 






85 










90 








95 








aaa 


gag 


aca 


caa 


gga 


atg 


aac 


tat 


tgg 


caa 


ggc tgg 


aag 


aaa 


cay 


tgt 


Lys 


Glu 


Thr 


Gin 


Gly 


Met 


Asn 


Tyr 


Trp 


Gin 


Gly Trp 


Lys 


Lys 


His 


Cys 




100 










105 








110 










gag 


ggg 


aga 


gac 


ctg 


tec 


gas 


tgg 


aaa 


aaa 


ggc tgt 


gag 


gtt 


tec 




Glu 


Gly 


Arg 


Asp 


Leu 


Ser 


Xaa 


Trp 


Lys 


Lys 


Gly Cys 


Glu 


Val 


Ser 





115 120 125 

taaactggaa ctggacccag gatgctttgc ascaacgccc tagggtttgc agtgaatgtc 691 

caaatgectg tgteatcttg tcccgtttee teccaatatt cettctcaaa cttggagagg 751 

gaaaattaag ctataetttt aagaaaataa atatttecat ttaaatgtca amaaaaaaaa 811 

ah 813 



<210> 294 
fjf;211> 778 
j!^212> DNA 
,;^'^213> Homo sapiens 

f;i;<22o> 

^=^\221> CDS 
;;%222> 154. .576 

y1<221> sig_j)eptide 

=; <222> 154, .360 

Li^222> Von Heijne matrix 

in§ score 4.80000019073486 

JT seq MMVLSLGIILASA/SF 

^'f<221> polyA_signal 
^f^^222> 737. .742 

<22l> polyA_site 
<222> 763 . , 775 

<400> 294 

agtaaaaaaa cactggaata aggaagggct gatgacttte agaagatgaa ggtaagtaga 60 
aaccgttgat gggactgaga aaccagagtk aaaaectctt tggagcttet gaggactcag 12 0 
ctggaaccaa cgggcacagt tggcaacacc ate atg aca tea caa cct gtt eee 174 

Met Thr Ser Gin Pro Val Pro 
-65 

aat gte ate aac tte tec caa 222 
Asn Val He Asn Phe Ser Gin 
-50 

ggg cag gat age ctg aag aaa 270 
Gly Gin Asp Ser Leu Lys Lys 
-35 

ggg act ate cag ate ttg tgt 318 
Gly Thr He Gin He Leu Cys 
-20 -15 
att ttg gca tct get tec tte 366 



aat gag ace 
Asn Glu Thr 
-60 

gca gag aaa 
Ala Glu Lys 
-45 

eat eta cac 
His Leu His 
-30 

ggc atg atg 



ate ata gtg 
He He Val 

eee gaa cec 
Pro Glu Pro 

gca gaa ate 
Ala Glu He 
-25 

gta ttg age 



etc cca tea 
Leu Pro Ser 
-55 

ace aac cag 
Thr Asn Gin 
-40 

aaa gtt att 
Lys Val He 

ttg ggg ate 
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Gly 


Met 


Met 


Val 


Leu 
-10 


Ser 


Leu 


Gly 


He 


He 
-5 


Leu 


Ala 


Ser 


Ala 


Ser 
1 


Phe 




tct 


cca 


aat 


ttt 


acc 


caa 


gtg 


act 


tct 


aca 


ctg 


ttg 


aac 


tct 


get 


tac 


414 


Ser 


Pro 


Asn 
5 


Phe 


Thr 


Gin 


Val 


Thr 
10 


Ser 


Thr 


Leu 


Leu 


Asn 
15 


Ser 


Ala 


Tyr 




cca 


ttc 


ata 


gga 


ccc 


ttt 


ttt 


gtr 


akt 


aaa 


btt 


tct 


gag 


gag 


ggc 


agg 


462 


Pro 


Phe 
20 


He 


Gly 


Pro 


Phe 


Phe 
25 


Val 


Xaa 


Lys 


Xaa 


Ser 
30 


Glu 


Glu 


Gly 


Arg 






S99 


caa 


ara 




gag 


gaa 


rat 


vcc 


aat 


age 


tta 


aac 


ttc 


cca 


set 


510 


Met 


Gly 


Gin 


Xaa 


Gly 


Glu 


Glu 


Xaa 


Xaa 


Asn 


Ser 


Leu 


Asn 


Phe 


Pro 


Xaa 




35 










40 










45 










50 




gcc 


age 


ttg 


eta 


tkt 


ttg 


ate 


tgc 


cag 


gav 


caa 


gga 


ttc 


aac 


ggt 


gaa 


558 


Ala 


Ser 


Leu 


Leu 


Xaa 
55 


Leu 


He 


Cys 


Gin 


Xaa 
60 


Gin 


Gly 


Phe 


Asn 


Gly 
65 


Glu 




tct 


tgt 


tct 


cct 


gtc 


ggg 


targataaca ggggttgctt rattttagat 




606 


Ser 


Cys 


Ser 


Pro 
70 


Val 


Gly 

























caatttctta tcagaetcaa ataaacattt cttttgaaaa tcatcttatt cttcacatta 666 
tcatcttgag ctatgatgga aactagtgas ktetctccag gtttaggcga aaaaaaaatc 726 
catgaattag gataaagttg ggaaggaaca ttttatacaa aaaaaaaaah cc 778 



:^|:210> 295 

%211> 1060 

'^^212> DNA 

i-;<213> Homo sapiens 

i,|c220> 
r|c221> CDS 
<222> 154 . .897 

^5c221> sig_peptide 

■'^^222> 154. .360 

'5223> Von Heijne matrix 

score 4.80000019073486 
seq MMVLSLGIILASA/SF 

<221> polyA__signal 
<222> 1017. .1022 



<221> polyA_site 
<222> 1044. .1054 



<400> 295 

agtaaaaaaa eactggaata aggaaggget gatgactttc agaagatgaa ggtaagtaga 60 
aaccgttgat gggactgaga aaccagagtk aaaacctett tggagcttet gaggacteag 120 
ctggaaccaa cgggcacagt tggcaacace ate atg aca tea caa cct gtt ccc 174 

Met Thr Ser Gin Pro Val Pro 
-65 

aat gag ace ate ata gtg etc cca tea aat gtc ate aac ttc tec caa 222 
Asn Glu Thr He He Val Leu Pro Ser Asn Val He Asn Phe Ser Gin 

-60 -55 -50 

gca gag aaa ccc gaa ccc ace aac cag ggg cag gat age ctg aag aaa 270 
Ala Glu Lys Pro Glu Pro Thr Asn Gin Gly Gin Asp Ser Leu Lys Lys 

-45 -40 -35 

eat eta cac gca gar rtc aaa gtt att ggg act ate cag ate ttg tgt 318 
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His 


Leu 


His 


Ala 


Glu 


Xaa 


Lys 


Val 


He 


Gly 


Thr 


He 


Gin 


He 


Leu 


Cys 




-30 










-25 










-20 










-15 




ggc 


atg 


atg 


gta 


ttg 


age 


ttg 


ggg 


ate 


att 


ttg 


gea 


tet 


get 


tec 


ttc 


366 


Gly 


Met 


Met 


Val 


Leu 
-10 


Ser 


Leu 


Gly 


He 


He 
-5 


Leu 


Ala 


Ser 


Ala 


Ser 
1 


Phe 




ecu 


cca 


aat 


ttt 


ace 


caa 


gtg 


act 


tct 


aca 


ctg 


ttg 


aac 


tct 


get 


tac 


414 


Ser 


Pro 


Asn 
5 


Phe 


Thr 


Gin 


Val 


Thr 
10 


Ser 


Thr 


Leu 


Leu 


Asn 
15 


Ser 


Ala 


Tyr 




cca 


ttc 


ata 


gga 


cec 


ttt 


ttt 


ttt 


ate 


ate 


tet 


ggc 


tct 


eta 


tea 


ate 


462 


Pro 


Phe 
20 


He 


Gly 


Pro 


Phe 


Phe 
25 


Phe 


He 


He 


Ser 


Gly 
30 


Ser 


Leu 


Ser 


He 




gcc 


aca 


aaa 


aaa 


agg 


tta 


ace 


aac 


ctt 


ttg 


gtg 


cat 


acc 


ace 


ctg 


gtt 


510 


Ala 


Thr 


Lys 


Lys 


Arg 


Leu 


Thr 


Asn 


Leu 


Leu 


Val 


His 


Thr 


Thr 


Leu 


Val 




35 










40 










45 










50 




gga 


age 


att 


Ctg 


agt 


get 


ctg 


tct 


gcc 


ctg 


gtg 


ggt 


ttc 


att 


aye 


ctg 


558 


Gly 


Ser 


He 


Leu 


Ser 
55 


Ala 


Leu 


Ser 


Ala 


Leu 
60 


Val 


Gly 


Phe 


He 


Xaa 
65 


Leu 




tct 


gte 


aaa 


cag 


gcc 


ace 


tta 


aat 


cet 


gcc 


tea 


ctg 


cak 


tgt 


gag 


ttg 


606 


Ser 


Val 


Lys 


Gin 
70 


Ala 


Thr 


Leu 


Asn 


Pro 
75 


Ala 


Ser 


Leu 


Xaa 


Cys 
80 


Glu 


Leu 




gmc 


aaa 


aat 


aat 


ata 


cca 


aca 


ara 


akt 


tat 


gtt 


yet 


tac 


ttt 


tat 


cat 


654 


f '^aa 


Lys 


Asn 


Asn 


He 


Pro 


Thr 


Xaa 


Xaa 


Tyr 


Val 


Xaa 


Tyr 


Phe 


Tyr 


His 




ill 




85 










90 










95 










^ij^at 


tea 


ctt 


tat 


ace 


aeg 


gae 


kgc 


tat 


aca 


gcc 


aaa 


gcc 


akt 


ctg 


get 


702 


: jAsp 


Ser 


Leu 


Tyr 


Thr 


Thr 


Asp 


Xaa 


Tyr 


Thr 


Ala 


Lys 


Ala 


Xaa 


Leu 


Ala 




hi- 


100 










105 










110 












't^ga 


act 


etc 


tct 


ctg 


atg 


Ctg 


att 


tgc 


act 


ctg 


ctg 


gaa 


ttc 


tgc 


cwa 


750 


;;l<3ly 


Thr 


Leu 


Ser 


Leu 


Met 


Leu 


He 


Cys 


Thr 


Leu 


Leu 


Glu 


Phe 


Cys 


Xaa 




UitlS 










120 










125 








130 




U iiSCt 


gtg 


etc 


act 


get 


gtg 


ctg 


egg 


tgg 


aaa 


cag 


get 


tac 


tct 


gac 


ttc 


798 


Xaa 


Val 


Leu 


Thr 


Ala 


Val 


Leu 


Arg 


Trp 


Lys 


Gin 


Ala 


Tyr 


Ser 


Asp 


Phe 












135 










140 










145 








ggg 


agt 


gta 


ctt 


ttc 


ctg 


cet 


cam 


agt 


tac 


att 


ggw 


aat 


tct 


ggm 


846 




Gly 


Ser 


Val 
150 


Leu 


Phe 


Leu 


Pro 


Xaa 
155 


Ser 


Tyr 


He 


Gly 


Asn 
160 


Ser 


Gly 




';ktg 


tee 


tea 


aaa 


atg 


acy 


cat 


gac 


tgt 


gga 


tat 


gaa 


gaa 


eta 


ttg 


act 


894 


^ii^jket 


Ser 


Ser 
165 


Lys 


Met 


Thr 


His 


Asp 
170 


Cys 


Gly 


Tyr 


Glu 


Glu 
175 


Leu 


Leu 


Thr 





tct taagaaaaaa gggagaaata ttaatcagaa agttgattet tatgataata 947 
Ser 

tggaaaagtt aaceattata gaaaagcaaa gettgagttt cetaaatgta agettttaaa 1007 

gtaatgaaca ttaaaaaaaa ecattatttc actgtcaaaa aaaaaaamec nkt 1060 



<210> 296 
<211> 444 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 146 . .292 

<221> sig_peptide 

<222> 146 . .253 

<223> Von Heijne matrix 
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score 5 , 5 

seq FTSMCILFHCLLS/FQ 

<221> polyA_signal 
<222> 395. .400 

<221> polyA_site 
<222> 433 . .444 

<400> 296 

aacttgggac aagaratcaa actttaaaga tggtctaaag cccctcttaa aggtctgact 60 
gtgtcggacc tctagagcta atctcactag atgtgagcca ttgtttatat tctagccatc 120 
ctttcatttc attctagaag acccc atg caa gtt ccc cac eta agg gtc tgg 172 

Met Gin Val Pro His Leu Arg Val Trp 
-35 -30 
aca cag gtg awa gat acc ttc att ggt tat aga aat ttg gga ttt aca 22 0 

Thr Gin Val Xaa Asp Thr Phe He Gly Tyr Arg Asn Leu Gly Phe Thr 

-25 -20 -15 

agt atg tgc ata ttg ttc cac tgt ctt ctt age ttt cag gtt ttc aaa 268 
Ser Met Cys He Leu Phe His Cys Leu Leu Ser Phe Gin Val Phe Lys 
-10 -5 15 

f^^ag aaa aga aaa ctt ara ctt ttc tgatgttctt ttttacgtaa ataaccattt 322 
"■tys Lys Arg Lys Leu Xaa Leu Phe 

J'^tattgttgtt ttgctttttc tgccttcaaa ctactcccac aggccaaata tavctggctg 382 
[^;!pttctttctg taaataaagt tttattgggc cacagccatg gccatctttt aaaaaaaaaa 442 
^P■a. 444 



<210> 297 

|,l<211> 754 
H^212> DNA 
j| :x213> Homo sapiens 

"'Jc220> 
'^221> CDS 
0!lc222> 126. .383 

<22l> sig_peptide 
<222> 126. .167 
<223> Von Heijne matrix 
score 7 . 5 

seq VALNLILVPCCAA/WC 

<221> polyA_signal 
<222> 726. .731 

<221> polyA_site 
<222> 743. .754 

<400> 297 

aattgtatgt tacgatgttg tattgatttt taagaaagta attkratttg taaaacttct 60 
gctcgtttac actgcacatt gaatacaggt aactaattgg wwggagaggg gaggtcactc 120 
ttttg atg gtg gcc ctg aac etc att ctg gtt ccc tgc tgc get get tgg 170 
Met Val Ala Leu Asn Leu He Leu Val Pro Cys Cys Ala Ala Trp 
-10 -5 1 
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tgc 


ga.c 


cca 


egg 


agg 


ace 


eac 


tee 


eag 


gat 


gae 


gtg 


etc 


egt 


age 


tet 


218 


Cys 


Asp 


Pro 


Arg 


Arg 


lie 


His 


Ser 


Gin 


Asp 


Asp 


Val 


Leu 


Arg 


Ser 


Ser 










5 










10 










15 








get 


get 


gat 


act 


ggg 


tet 


gcg 


atg 


eag 


egg 


egt 


gag 


gee 


tgg 


get 


ggt 


266 


Ala 


TV T — 

Ala 


Asp 


Thr 


Gly 


Ser 


Ala 


Met 


Gin 


Arg 


Arg 


Glu 


Ala 


Trp 


Ala 


Gly 








20 










25 










30 










tgg 


aga 


agg 


tea 


eaa 


ecc 


ttc 


tet 


gtt 


ggt 


ctg 


cct 


tet 


get 


gaa 


aga 


314 


Trp 


Arg 


Arg 


Ser 


Gin 


Pro 


Phe 


Ser 


Val 


Gly 


Leu 


Pro 


Ser 


Ala 


Glu 


Arg 






35 










40 










45 












etc 


gag 


aac 


eaa 


cca 


ggg 


aag 


ctg 


tee 


tgg 


agg 


tec 


ctg 


gte 


gga 


gag 


362 


Leu 


Glu 


Asn 


Gin 


Pro 


Gly 


Lys 


Leu 


Ser 


Trp 


Arg 


Ser 


Leu 


Val 


Gly 


Glu 




50 










55 










60 










65 




gga 


cat 


aga 


ate 


tgt 


gae 


etc 


tgacrrctgt gaasccaeee tgggctaear 


413 


Gly 


His 


Arg 


He 


Cys 


Asp 


Leu 























70 



aaaeeacagt ettcccagea attattaeaa ttcttgaatt eettggggat tttttactge 473 

ecttteaaag eacttaaktg tkrratctaa egtkttecag tgtctgtetg aggtgaetta 533 

aaaaateaga acaaaaettc tattatccag agteatggga gagtacaece tttceaggaa 593 

taatgttttg ggaaacaetg aaatgaaatc tteecagtat tataaattgt gtatttaaaa 653 

aaaagaaact tttetgaatg cetacctgge ggtgtataee aggeagtgtg ceagtttaaa 713 

aagatgaaaa agaataaaaa cttttgagga aaaaaaaaaa a 754 



;;^<210> 298 
J;<211> 629 
■k212> DNA 
.p<213> Homo sapiens 

1<220> 
<221> CDS 
^i,<222> 66 . .497 

f<221> sig_peptide 
';<222> 66. .239 
Jk223> Von Heijne matrix 

score 5.40000009536743 
seq QLLDSVLWLGALG/LT 



<221> polyA_signal 
<222> 594. .599 



<221> polyA__site 
<222> 618. .629 



<400> 298 

aacteccaga atgctgaeea aagtgggagg ageactaggt etteeegtca eetecaeete 60 
tctcc atg aee egg etc tge tta ecc aga ecc gaa gea egt gag gat ceg 110 
Met Thr Arg Leu Cys Leu Pro Arg Pro Glu Ala Arg Glu Asp Pro 
-55 -50 -45 

ate cca gtt cct cca agg ggc ctg ggt get ggg gag ggg tea ggt agt 158 
He Pro Val Pro Pro Arg Gly Leu Gly Ala Gly Glu Gly Ser Gly Ser 

-40 -35 -30 

cca gtg egt cca cct gta tec ace tgg ggc cct age tgg gee cag etc 206 
Pro Val Arg Pro Pro Val Ser Thr Trp Gly Pro Ser Trp Ala Gin Leu 

-25 -20 -15 

ctg gae agt gte eta tgg ctg ggg gca eta gga ctg aca ate cag gea 254 



-236- 



Leu Asp Ser Val Leu Trp Leu Gly Ala Leu Gly Leu Thr lie Gin Ala 

-10 -5 15 

gtc ttt tec acc act ggc cca gcc ctg ctg ctg ctt ctg gtc age ttc 302 
Val Phe Ser Thr Thr Gly Pro Ala Leu Leu Leu Leu Leu Val Ser Phe 

10 15 20 

etc acc ttt gac ctg etc eat agg ccc gca gtc aca etc tgc eac age 350 
Leu Thr Phe Asp Leu Leu His Arg Pro Ala Val Thr Leu Cys His Ser 

25 30 " 35 

gca aae ttc tea eea ggg gee aga gtc agg ggg eeg gtg aag gtc ctg 3 98 

Ala Asn Phe Ser Pro Gly Ala Arg Val Arg Gly Pro Val Lys Val Leu 

40 45 50 

gac age agg agg etc tac tec tgc aaa tgg gta cag tet cag gac aac 446 
Asp Ser Arg Arg Leu Tyr Ser Cys Lys Trp Val Gin Ser Gin Asp Asn 

55 60 65 

tta gcc tec agg aag eac tgc tgc tgc tgc tea tgg ggc tgg gee cgc 494 
Leu Ala Ser Arg Lys His Cys Cys Cys Cys Ser Trp Gly Trp Ala Arg 
70 75 80 85 

tee tgaaaacctg tggcatgeec ttgwaeeetg cttggcetgg ctttetgcct 547 
Ser 

eeatccttgg gcctgakane ccctccecac aacteagtgt cettcaaata tacaatgace 607 
accettctte aaaaaaaaaa aa 629 



a1 



";^210> 299 
"!<211> 765 
0212 > DNA 
|f<213> Homo sapiens 

i'k220> 

: <221> CDS 

,|<222> 49. .411 

t<221> sig_peptide 
'"k222> 49. . 96 
^[^2 23> Von Heijne matrix 
If score 10.1000003814697 

seq LVLTLCTLPLAVA/SA 



<221> polyA__signal 
<222> 732 . . 737 



<221> polyA_site 
<222> 750 . . 763 



<400> 299 

aaagatccct gcagcceggc aggagagaag gctgagcctt ctggcgtc atg gag agg 57 

Met Glu Arg 
-15 

etc gtc eta ace ctg tgc ace etc eeg ctg get gtg gcg tet get ggc 105 
Leu Val Leu Thr Leu Cys Thr Leu Pro Leu Ala Val Ala Ser Ala Gly 

-10 -5 1 

tgc gcc acg aeg cca get cgc aac ctg age tgc tac cag tgc ttc aag 153 
Cys Ala Thr Thr Pro Ala Arg Asn Leu Ser Cys Tyr Gin Cys Phe Lys 

5 10 15 

gtc age age tgg aeg gag tgc eeg ccc acc tgg tgc age eeg ctg gac 201 
Val Ser Ser Trp Thr Glu Cys Pro Pro Thr Trp Cys Ser Pro Leu Asp 
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20 










25 










30 










35 




caa 


gtc 


tgc 


ate 


tec 


aac 


gag 


gtg 


gtc 


gtc 


tct 


ttt 


agt 


gag 


tey 


ccc 


249 


Gin 


Val 


Cys 


lie 


Ser 


Asn 


Glu 


Val 


Val 


Val 


Ser 


Phe 


Ser 


Glu 


Ser 


Pro 












40 










45 










50 






ccg 


ggc 


aga 


ggg 


cas 


gtg 


cca 


bgt 


gcc 


ggg 


gaa 


kgg 


ccg 


gtg 


ccc 


ccg 


297 


Pro 


Gly 


Arg 


Gly 


Xaa 


Val 


Pro 


Xaa 


Ala 


Gly 


Glu 


Xaa 


Pro 


Val 


Pro 


Pro 










55 










60 










65 








cct 


etc 


wkc 


gac 


tta 


bet 


atg 


act 


cct 


egg 


ckc 


yec 


agg 


gee 


tgg 


ggc 


345 


Pro 


Leu 


Xaa 


Asp 


Leu 


Xaa 


Met 


Thr 


Pro 


Arg 


Xaa 


Xaa 


Arg 


Ala 


Trp 


Gly 








70 










75 










80 










cck 


gtg 


ggt 


ccd 


aaa 


gtg 


cct 


cct 


get 


gtc 


tct 


cec 


gcg 


ctg 


ggc 


teg 


393 


Pro 


Val 


Gly 


Pro 


Lys 


Val 


Pro 


Pro 


Ala 


Val 


Ser 


Pro 


Ala 


Leu 


Gly 


Ser 






85 










90 










95 












ggc 


gag 


cat 


CCS 


rva 


btg 


tgaatkkkga cttttttctc ckccatttga 




441 


Gly 


Glu 


His 


Pro 


Xaa 


Xaa 
























100 










105 

























agtgtcacta ggaactgtea geaggacaaa ggctctgatg tcactgaatt tacaaaraca 501 

geaggaacrs ackggtgggg atgggcagct gttcrargcr atgggtkatc tgeccttect 561 

ggcacageac artacaeetg ccataeaace carcatcagg cakgetgcac tggaatcgat 621 

acagtgtatg acaatgtcat atagtataac aeaacataat gaatataacg tgtatattgc 681 

aacttaatat aataegatgt aatataatge tacataatae aacataatat aataaaatag 741 

f f|aatgcaacac aaaaaaaaaa aacc 765 



^'^<210> 300 
^^^)<211> 623 
4^<212> DNA 
US<213> Homo sapiens 

<220> 

L|,<221> CDS 
nK222> 49. .534 

r,<221> sig_peptide 
];f<222> 49. . 96 
';Jf<223> Von Heijne matrix 
W score 10.1000003814697 

seq LVLTLCTLPLAVA/SA 

<221> polyA_signal 
<222> 593 . .598 

<221> polyA_site 
<222> 612 . .623 

<400> 300 

aaagatccct gcageeeggc aggagagaag gctgagcett ctggcgtc atg gag agg 57 

Met Glu Arg 
-15 

etc gtc eta ace ctg tgc ace etc ccg ctg get gtg gcg tct get ggc 105 
Leu Val Leu Thr Leu Cys Thr Leu Pro Leu Ala Val Ala Ser Ala Gly 

-10 -5 1 

tgc gcc acg aeg cca get ege aac ctg age tgc tac cag tgc tte aag 153 
Cys Ala Thr Thr Pro Ala Arg Asn Leu Ser Cys Tyr Gin Cys Phe Lys 

5 10 15 

gtc age age tgg aeg gag tgc ccg ccc acc tgg tgc age ccg ctg gac 201 
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Val 


Ser 


Ser 


Trp 


Thr 


Glu 


Cys 


Pro 


Pro 


Thr 


Trp 


Cys 


Ser 


Pro 


Leu 


Asp 




z U 










25 










30 










35 




caa 


gtc 


tgc 


ate 


tee 


aac 


gag 


gtg 


gtc 


gtc 


tct 


ttt 


aaa 


tgg 


agt 


gta 


249 


Gin 


Val 


Cys 


lie 


Ser 
40 


Asn 


Glu 


Val 


Val 


Val 
45 


Ser 


Phe 


Lys 


Trp 


Ser 
50 


Val 




cgc 


gtc 


ctg 


etc 


age 


aaa 


cgc 


tgt 


get 


ccc 


aga 


tgt 


ccc 


aac 


gac 


aac 


297 


Arg 


Val 


Leu 


Leu 
55 


Ser 


Lys 


Arg 


Cys 


Ala 
60 


Pro 


Arg 


Cys 


Pro 


Asn 
65 


Asp 


Asn 




atg 


aaic 


ttc 


gaa 


tgg 


teg 


eeg 


gee 


ccc 


atg 


gtg 


caa 


ggc 


gtg 


ate 


aec 


345 


Met 


Xaa 


Phe 
70 


Glu 


Trp 


Ser 


Pro 


Ala 
75 


Pro 


Met 


Val 


Gin 


Gly 
80 


Val 


He 


Thr 






cgc 


tgc 


tgt 


tec 


tgg 


get 


etc 


tgc 


aac 


agg 


gca 


ctg 


ace 


cca 


cag 


393 


Arg 


Arg 

o tz 


Cys 


Cys 


Ser 


Trp 


Ala 
90 


Leu 


Cys 


Asn 


Arg 


Ala 
95 


Leu 


Thr 


Pro 


Gin 






999 


cgc 


tgg 


gee 


ctg 


era 


ggg 


ggg 


etc 


ctg 


etc 


cag 


gac 


cet 


teg 


441 


GlU 


Gly 


Arg 


Trp 


Ala 


Leu 


Xaa 


Gly 


Gly 


Leu 


Leu 


Leu 


Gin 


Asp 


Pro 


Ser 




100 










105 










110 










115 




agg 


ggc 


ara 


aaa 


aec 


tgg 


gtg 


egg 


cea 


cag 


ctg ggg 


etc 


cca 


etc 


tgc 


489 


Arg 


Gly 


Xaa 


Lys 


Thr 


Trp 


Val 


Arg 


Pro 


Gin 


Leu Gly 


Leu 


Pro 


Leu 


Cys 












120 










125 










130 






ctt 


ccc 


awt 


tec 


aac 


ccc 


etc 


tgc 


cca 


rgg 


gaa 


ace 


cag 


gaa 


gga 




534 




Pro 


Xaa 


Ser 


Asn 


Pro 


Leu 


Cys 


Pro 


Xaa 


Glu 


Thr 


Gin 


Glu 


Gly 












135 










140 










145 







Jj4:aaeactgtg ggtgecccca cctgtgcatt gggaccaera ettcacccte ttggaracaa 594 
H^taaactctca tgcccceaaa aaaaaaaaa 623 



ij^^^210> 301 

|ji:211> 571 

<212> DNA 

l^j:<213> Homo sapiens 

^'^^220> 
J''<:221> CDS 
';4222> 86 . .415 

|V;ic221> sig_peptide 
<222> 86 . .145 
<223> Von Heijne matrix 

score 9.80000019073486 

seq FTIGLTLLLGXQA/MP 

<221> polyA_signal 
<222> 540. .545 

<221> polyA__site 
<222> 560. .571 

<400> 301 

aaaaaetcac ccagtgagtg tgageattta agaageatcc tetgecaaga ecaaaaggaa 60 
agaagaaaaa bggeeaaaag ccaaa atg ara ctg atg gta ctt gtt ttc ace 112 

Met Xaa Leu Met Val Leu Val Phe Thr 

-20 -15 
att ggg eta act ttg ctg eta gga rtt caa gee atg ect gea aat cgc 160 
He Gly Leu Thr Leu Leu Leu Gly Xaa Gin Ala Met Pro Ala Asn Arg 
-10 -5 15 
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etc 


tct 


tgc 


tac 


aga 


aag 


ata 


eta 


aaa 


gat 


eac 


aac 


tgt 


eac 


aac 


ett 


208 


Leu 


Ser 


Cys 


Tyr 


Arg 


Lys 


He 


Leu 


Lys 


Asp 


His 


Asn 


Cys 


His 


Asn 


Leu 












10 










15 










20 






ccg 


gaa 


gga 


gta 


get 


gac 


ctg 


aca 


cag 


att 


gat 


gte 


aat 


gte 


cag 


gat 


256 


Pro 


Glu 


Gly 


Val 


Ala 


Asp 


Leu 


Thr 


Gin 


lie 


Asp 


Val 


Asn 


Val 


Gin 


Asp 










25 










30 










35 








cat 


ttc 


tgg 


gat 


ggg 


aag 


gga 


tgt 


gag 


atg 


ate 


tgt 


tac 


tgc 


aac 


ttc 


304 


His 


Phe 


Trp 


Asp 


Gly 


Lys 


Gly 


Cys 


Glu 


Met 


He 


Cys 


Tyr 


Cys 


Asn 


Phe 








40 










45 










50 










aag 


cga 


att 


get 


ctg 


ctg 


ccc 


aaa 


aga 


cgt 


ttt 


ett 


tgg 


ace 


aaa 


gat 


352 


Lys 


Arg 


He 


Ala 


Leu 


Leu 


Pro 


Lys 


Arg 


Arg 


Phe 


Leu 


Trp 


Thr 


Lys 


Asp 






55 










60 










65 












etc 


ttt 


cgt 


gat 


tec 


ttg 


caa 


caa 


tea 


atg 


aga 


ate 


ttc 


atg 


tat 


tct 


400 


Leu 


Phe 


Arg 


Asp 


Ser 


Leu 


Gin 


Gin 


Ser 


Met 


Arg 


He 


Phe 


Met 


Tyr 


Ser 




70 










75 










80 










85 





ggc gaa cac cat tec tgatttccca caaactgcac tacatcagta taactgcatt 455 
Gly Glu His His Ser 
90 

tetagtttet atatagtgea atagageata gattctataa attcttaett gtetaagaaa 515 
gtaaatctgt gttaaacaag tagtaataaa agttaattea atceaaaaaa aaaaaa 571 



J|c210> 302 
^^;^211> 612 
fj:212> DNA 
^'^j<213> Homo sapiens 

yj|<220> 
yi<221> CDS 
<222> 56. .268 

nii<221> sig_peptide 
^^!<222> 56. .100 
^ ,<223> Von Heijne matrix 
j!j score 4.59999990463257 

^:J;f seg LLTHNLLSSHVRG/VG 

<221> polyA_signal 
<222> 584 . .589 

<221> polyA_site 
<222> 601. .612 

<400> 302 

etaatcgaaa agggggattt tccggttccg gcctggcgag agtttgtgcg gcgac atg 58 

Met 
-15 

aaa ctg ett ace cac aat ctg ctg age teg cat gtg egg ggg gtg ggg 106 
Lys Leu Leu Thr His Asn Leu Leu Ser Ser His Val Arg Gly Val Gly 

-10 -5 1 

tee cgt ggc ttc ccc ctg cgc etc cag gee ace gag gte cgt ate tgc 154 
Ser Arg Gly Phe Pro Leu Arg Leu Gin Ala Thr Glu Val Arg He Cys 

5 10 15 

cct gtg gaa ttc aac ccc aac ttc gtg geg cgt atg ata ect aaa gtg 202 
Pro Val Glu Phe Asn Pro Asn Phe Val Ala Arg Met He Pro Lys Val 
20 25 30 
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gag tgg teg gcg ttc ctg gag gcg rmc gat aac ttg cgt ctg ate cag 250 
Glu Trp Ser Ala Phe Leu Glu Ala Xaa Asp Asn Leu Arg Leu lie Gin 
35 40 45 50 

gtg ccg aga agg gcc ggt tgagggatat gaggagaatg aggagtttct 298 
Val Pro Arg Arg Ala Gly 
55 

gaggaccatg caccacctgc tgctggaggt ggamstgaka gagggcaccc tgcagtgccc 358 

ggaatctgga cgtatgttcc ccatcagccg cgggatcccc aacatgctgc tgagtgaaga 418 

ggaaactgag agttgattgt gccaggcgcc agtttttctt gttatgactg tgtatttttg 478 

ttgatctata ccctgtttcc gaattctgcc gtgtgtatcc ccaacccttg acccaatgac 538 

accaaacaca gtgtttttga gctcggtatt atatattttt ttctcattaa aggtttaaaa 598 

ccaaaaaaaa aaaa 512 



<210> 303 
<211> 539 
<212> DNA 
<213> Homo sapiens 

<220> 
,.,<:221> CDS 
;:,:;<:222> 32. .328 

^^^221> sig_peptide 

Ni^222> 32. .103 

y'k223> Von Heijne matrix 

.IS score 4.59999990463257 

hi seq FFIFCSLNTLLLG/GV 

<221> polyA__signal 
t|<222> 508. .513 

:''S221> polyA^site 
f^:222> 528. .539 

^l!l<400> 303 

y;|aacaactatc ctgcctgctg cttgctgcac c atg aag tct gcc aag ctg gga 52 

Met Lys Ser Ala Lys Leu Gly 
-20 



ttt 


ctt 


eta 


aga 


ttc 


ttc 


ate 


ttc 


tgc 


tea 


ttg 


aat 


ace 


ctg 


tta 


ttg 


100 


Phe 


Leu 


Leu 
-15 


Arg 


Phe 


Phe 


He 


Phe 
-10 


Cys 


Ser 


Leu 


Asn 


Thr 
-5 


Leu 


Leu 


Leu 




ggt 


ggt 


gtt 


aat 


aaa 


att 


gcg 


gag 


aag 


ata 


tgt 


gga 


gac 


etc 


aaa 


gat 


148 


Gly 


Gly 
1 


Val 


Asn 


Lys 


He 
5 


Ala 


Glu 


Lys 


He 


Cys 
10 


Gly 


Asp 


Leu 


Lys 


Asp 
15 




ccc 


tgc 


aaa 


ttg 


gac 


atg 


aat 


ttt 


gga 


age 


tgc 


tat 


gaa 


gtt 


cac 


ttt 


196 


Pro 


Cys 


Lys 


Leu 


Asp 
20 


Met 


Asn 


Phe 


Gly 


Ser 
25 


Cys 


Tyr 


Glu 


Val 


His 
30 


Phe 




aga 


tat 


tte 


tae 


aac 


aga 


ace 


tee 


aaa 


aga 


tgt 


gaa 


act 


ttt 


gtc 


tte 


244 


Arg 


Tyr 


Phe 


Tyr 
35 


Asn 


Arg 


Thr 


Ser 


Lys 
40 


Arg 


Cys 


Glu 


Thr 


Phe 
45 


Val 


Phe 




tec 


age 


tgt 


aat 


ggc 


aac 


ctt 


aac 


aac 


ttc 


aag 


ctt 


aaa 


ata 


gaa 


cgt 


292 


Ser 


Ser 


Cys 


Asn 


Gly 


Asn 


Leu 


Asn 


Asn 


Phe 


Lys 


Leu 


Lys 


He 


Glu 


Arg 








50 










55 










60 








gaa 


gta 


kcc 


tgt 


gtt 


gea 


aaa 


tac 


aaa 


cca 


ccg 


agg 


tgagaggatg 




338 


Glu 


Val 


Xaa 


Cys 


Val 


Ala 


Lys 


Tyr 


Lys 


Pro 


Pro 


Arg 
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65 70 75 

tgaactcatg aagttgtctg ctgcaccatc cgaaataaag acacaagaaa attcaractg 398 

atttwgaaat ctttgttwta tttccmymak ggcgwktaag cttccatatg tttgctattt 458 

tcctgaccct agttttgtct ttcctggaaa ttaactgtat gakcattasa atgaaagagt 518 

ctttctgtca aaaaaaaaaa a 539 



<210> 304 
<211> 964 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 21. .527 

<221> sig__peptide 
<222> 21. ,95 
<223> Von Heijne matrix 
score 8 . 5 
...b seq LKVLLLPLAPAAA/QD 

y|:221> polyA^signal 
J-;4:222> 921. .926 

y'fe:221> polyA__site 
4!<222> 953 . .963 

|j'|c4 00> 304 

agggcggatc ttctccggcc atg agg aag cca gcc get ggc ttc ctt ccc tea 53 

Met Arg Lys Pro Ala Ala Gly Phe Leu Pro Ser 
hi -25 -20 -15 

i^^^ ^tc ctg cct ctg gca cct gcc gca gcc cag gat 101 

f%eu Leu Lys Val Leu Leu Leu Pro Leu Ala Pro Ala Ala Ala Gin Asp 
If -10 -5 1 

^licg act cag gcc tec act cca ggc age cct etc tct cct ace gaa tac 149 
yJSer Thr Gin Ala Ser Thr Pro Gly Ser Pro Leu Ser Pro Thr Glu Tyr 
5 10 15 

caa cgc ttc ttc gea ctg ctg act cca acc tgg aag gca gar act acc 197 
Gin Arg Phe Phe Ala Leu Leu Thr Pro Thr Trp Lys Ala Glu Thr Thr 

20 25 30 

tgc cgt etc cgt gca acc cae ggc tgc egg aat ccc aca etc gte cag 245 
Cys Arg Leu Arg Ala Thr His Gly Cys Arg Asn Pro Thr Leu Val Gin 
35 40 45 50 

ctg gac caa tat gaa aac cae ggc tta gtg ccc gat ggt get gte tgc 293 
Leu Asp Gin Tyr Glu Asn His Gly Leu Val Pro Asp Gly Ala Val Cys 

55 60 65 

tec aac etc cct tat gcc tec tgg ttt gag tct ttc tgc cag ttc act 341 
Ser Asn Leu Pro Tyr Ala Ser Trp Phe Glu Ser Phe Cys Gin Phe Thr 

70 75 80 

cac tac egt tgc tee aac cae gte tac tat gcc aag aga gte ctg tgt 389 
His Tyr Arg Cys Ser Asn His Val Tyr Tyr Ala Lys Arg Val Leu Cys 

85 90 95 

tec cag cca gte tct att etc tew cet aac act etc aag gag ata gaa 43 7 

Ser Gin Pro Val Ser lie Leu Ser Pro Asn Thr Leu Lys Glu lie Glu 
100 105 110 



-242- 



sct tea get gaa gtc tea eee ace aea gat gac etc ccc cat etc ace 485 

Xaa Ser Ala Glu Val Ser Pro Thr Thr Asp Asp Leu Pro His Leu Thr 

115 120 125 130 

eea ctt eae agt gac aga acg cca gac ctt cca gcc ctg gee 52 7 

Pro Leu His Ser Asp Arg Thr Pro Asp Leu Pro Ala Leu Ala 

135 140 

tgagaggctc ageaacaacg tggaagaget cctacaatce tccttgtccc tgggaggcca 587 

ggagcaagcg ccagagcaca agcaggagca aggagtggag eacaggcagg agecgacaca 647 

agaacaeaag caggaagagg ggcagaaaea ggaagagcaa gaagaggaae aggaagagga 707 

gggaaagcag gaagaaggae aggggaetaa ggagggacgg gaggetgtgt etcagctgea 767 

gaeagactea gageccaagt tteactctga atetctatet tctaaccctt ectcttttgc 827 

tccccgggta cganaagtag agtctactcc tatgataatg gagaacatcc aggagctcat 887 

tcgateagcc caggaaatag atgaaatgaa tgaaatatat gatgagaact ectaetggag 947 

aaaccaaaaa aaaaaak 9^4 



<210> 305 

<211> 684 

<212> DNA 

<213> Homo sapiens 

'%<220> 

■;';^22i> CDS 

P;^222> 147. .647 

i/k221> sigjpeptide 
|i<222> 147. .374 
iJ,J<223> Von Heijne matrix 
yi , score 3 . 5 

seq LASASELPLGSRP/AP 

^'■:i<221> polyA_site 
'!<222> 668. .681 

'^;f<400> 3 05 

Ijkaettcctgt gageceggeg gtgacaacgg caacatggce cgtgaacgga gctgaagteg 60 
likcgaettctc ctrgrarmcc ecgactgagg eggagacgaa ggtgctgcag gcgcgacggg 120 
ageggcaaga tegeatctce cggcte atg gge gac tat ctg ctg cgc ggt tac 173 

Met Gly Asp Tyr Leu Leu Arg Gly Tyr 
-75 -70 
cgc atg ctg gge gag acg tgt geg gac tge ggg acg ate etc etc caa 221 
Arg Met Leu Gly Glu Thr Cys Ala Asp Cys Gly Thr He Leu Leu Gin 

-65 -60 -55 

gac aaa cag egg aaa ate tac tge gtg get tgt cag gaa etc gac tea 269 
Asp Lys Gin Arg Lys He Tyr Cys Val Ala Cys Gin Glu Leu Asp Ser 

-50 -45 -40 

gac gtg gat aaa gat aat ccc get ctg aat gee cag get gcc etc tec 317 
Asp Val Asp Lys Asp Asn Pro Ala Leu Asn Ala Gin Ala Ala Leu Ser 
-35 -30 -25 -20 

caa get egg gag cac cag ctg gee tea gee tea gag etc ccc ctg gge 365 
Gin Ala Arg Glu His Gin Leu Ala Ser Ala Ser Glu Leu Pro Leu Gly 

-15 -10 -5 

tct ega ect geg ccc caa ccc cca gta cet cgt ccg gag cac tgt gag 413 
Ser Arg Pro Ala Pro Gin Pro Pro Val Pro Arg Pro Glu His Cys Glu 

15 10 
gga get gea gea gga etc aag gea gee cag ggg cca cet get ect get 461 
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Gly Ala Ala Ala Gly Leu 
15 

gtg cct cca aat aca rat 
Val Pro Pro Asn Thr Xaa 
30 35 
caa aag ctg acc tgg gcc 
Gin Lys Leu Thr Trp Ala 
50 

gga aaa mta gca tec age 
Gly Lys Xaa Ala Ser Ser 
65 

ccc tgc gca gcc tgc age 
Pro Cys Ala Ala Cys Ser 
80 

tgagaaaaac ctetagaaaa ac 



Lys Ala Ala Gin Gly Pro 
20 25 
gtc atg gcc tgc aca cag 
Val Met Ala Cys Thr Gin 
40 

tct get gaa ctg ggc tet 
Ser Ala Glu Leu Gly Ser 
55 

tgt gtg gcc tta tec gcg 
Cys Val Ala Leu Ser Ala 
70 

age tac age act aag aga 
Ser Tyr Ser Thr Lys Arg 
85 

laaaaaaaa aaaaccc 



Pro Ala Pro Ala 

aca gcc etc ttg 509 
Thr Ala Leu Leu 
45 

ane acc tee cyg 557 
Xaa Thr Ser Xaa 
60 

cat gtg egg agg 6 05 

His Val Arg Arg 
75 

age ccc 647 

Ser Pro 

90 

684 



<210> 306 
<211> 693 
<212> DNA 
,;,,^^<213> Homo sapiens 

^f^<22 0> 
pJ<221> CDS 
^^<222> 262 . .471 

;:|S<221> sig_peptide 

y.|<222> 262 . .306 

[fl<223> Von Heijne matrix 

score 3 . 5 
r seq LCFLLPHHRLQEA/RQ 

l'y221> polyA^signal 
^^''>:222> 663 . .668 

yj|<221> polyA^site 
0|<222> 682. .693 

<400> 306 

atttegeggc getegebgma eyhsgwtgtt eagcacctte ggteeggttg aggttgteaa 60 

gteggmecaa acaggttgtt tctctgcagt ttceaacatg geagggmsgt ttaatagaca 120 

tggataagaa gtecactcac agaaatectg aagatgecag ggetggcaaa tatgaaggta 180 

aacaeaaacg aaagaaaaga agaaageaaa aceaaaaeca gcaccgatec cgacatagat 240 



cagtgaegte ' 


tttttcttea g 


atg 


ate 


eta 


tgt 


ttc 


ctt 


ctt 


cct eat 


eat 


291 














Met 


He 


Leu 


Cys 


Phe 


Leu 


Leu 


Pro His 


His 
















-15 










-10 










cgt 


ctt 


cag 


gaa 


gee 


aga 


cag 


att 


caa 


gta 


ttg 


aag 


atg 


ctt cca 


agg 


339 


Arg 
-5 


Leu 


Gin 


Glu 


Ala 


Arg 
1 


Gin 


He 


Gin 


Val 
5 


Leu 


Lys 


Met 


Leu Pro 


Arg 




gaa 


aaa 


tta 


aga 


aga 


aga 


gaa 


gag 


aga 


aaa 


caa 


ata 


aat 


10 
ggg aaa 


aaa 


387 


Glu 


Lys 


Leu 


Arg 


Arg Arg 


Glu 


Glu 


Arg 


Lys 


Gin 


He 


Asn 


Gly Lys 


Lys 










15 










20 










25 




raa 


agg 


aca 


aaa 


tat 


gaa 


aca 


cca 


aga 


aaa 


rga 


raa 


gga 


aaa aaa 


gga 


435 


Xaa 


Arg 


Thr 


Lys 


Tyr 


Glu 


Thr 


Pro 


Arg 


Lys 


Xaa 


Xaa 


Gly 


Lys Lys 


Gly 








30 










35 










40 




gga 


aac 


mac 


cmc 


wtw 


tkt 


cmc 


ctt 


tec 


aar 


agg 


gae 


tgaaaetggg 




481 
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Gly Asn Xaa Xaa Xaa Xaa Xaa Leu Ser Lys Arg Asp 

45 50 55 

ctgacccttt tgatttccaa vctcascgtt ttggtgtaag gcggccaaar aaggatgcgg 541 

ascccagcac tgtgaagcct acaaaaacat tgatgcgctg gcttggggat ttgaatttga 601 

acatctttca cactaagttc agactcatga aaccaatctt cagatgctct gtaaaccaca 661 

taataaagag tttggaaatt aaaaaaaaar aa 693 



<210> 307 

<211> 1656 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 74. .1216 

<221> sig_peptide 

<222> 74. .172 

<223> Von Heijne matrix 

score 5.80000019073486 
"^^1; seq XLCLGMALCPRQA/TR 

^^|^<22l> polyA__signal 
H<222> 1627. .1632 

4S<221> polyA_site 
y<222> 1640. .1652 

,j' '<400> 307 

l.ln^^^t^tt^^c gtctcaacgt tcggatcagc agcttttttc cattctctct ctccacttct 60 
|tcagtgagca gcc atg agt tgg act gtg cct gtt gtg egg gcc age cag 109 
rf J^et Ser Trp Thr Val Pro Val Val Arg Ala Ser Gin 

n -30 -25 

';;^ga gtg age teg gtg gga 
t^;|Arg Val Ser Ser Val Gly 

m -20 

tgt ceg cgt caa gca acg 
Cys Pro Arg Gin Ala Thr 
-5 1 
acc ecc gtg age aag atg 
Thr Pro Val Ser Lys Met 
15 

ttc act tec gar aag ecc 
Phe Thr Ser Glu Lys Pro 
30 

act gga teg gtg ggc atg 
Thr Gly Ser Val Gly Met 
45 

ttg agt gat gaa ctt gee 
Leu Ser Asp Glu Leu Ala 
60 65 
9gt gag acr atg gat ctt 
Gly Glu Thr Met Asp Leu 
80 

aat att gtt tgt age aaa 



geg aat kte eta 
Ala Asn Xaa Leu 
-15 

cge ate ccg etc 
Arg lie Pro Leu 
5 

geg act gtg aar 
Ala Thr Val Lys 
20 

gtt eat eac agt 
Val His His Ser 
35 

gcc tgc get ate 
Ala Cys Ala He 
50 

ctt gtg gat ctt 
Leu Val Asp Leu 

caa cat ggc age 
Gin His Gly Ser 
85 

rat tac ttt gtc 



tgc ctg ggg atg 
Cys Leu Gly Met 
-10 

aac ggc ace tgg 
Asn Gly Thr Trp 

agt gag ctt att 
Ser Glu Leu He 
25 

aag gtc tec ate 
Lys Val Ser lie 
40 

age ate tta tta 
Ser He Leu Leu 
55 

gat gaa rac aaa 
Asp Glu Xaa Lys 
70 

cct ttc acg aaa 
Pro Phe Thr Lys 

aea gca aac tec 



gee ctg 157 
Ala Leu 

etc ttc 205 
Leu Phe 
10 

gag cgt 253 
Glu Arg 

ata gga 3 01 

He Gly 

aaa ggc 349 
Lys Gly 

ctg aag 397 
Leu Lys 
75 

atg cca 445 
Met Pro 
90 

aac eta 493 
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Asn 


xxe 


V ai 


Cys 


Ser 


Lys 


Aaa 


Tyr 


Fne 


Val 


Thr 


Ala 


Asn 


Ser 


Asn 
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a -t- +- 


dUC 
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cge 


eaa 
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aag 
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gaa 


acg 


cge 


ett 


541 




X X C 


Tl fa 
X X c: 


X lix 


Al a 
r\Xd 


\jxy 


a 1 a 
AXd 


Arg 


ni n 

oin 


Aaa 


Lys 


r*l ir 

vjiy 


CjIU 


Thr 


Arg 


Leu 








Tin 










llo 
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U Cd 


s tc 


eag 


cga 


-J --3 -H 

aau 


gtg 


gcc 


ate 


ttc 


aag 


tta 


atg 


att 


tee 


agt 


589 


Asn 


Leu 


Ya a 

Aaa 


oin 


Arg 


Asn 


vai 


Ala 


Tl ^ 

lie 


Phe 


Lys 


Leu 


Met 


He 


Ser 


Ser 






125 










130 










135 














gtc 


cag 


uae 


age 


eec 


cac 


tge 


aaa 


ctg 


att 


att 


gtt 


tec 


aat 


eca 


637 


U.6 


vai 


vjin 


Tyr 


Ser 


Pro 


His 


Cys 


Lys 


Leu 


He 


He 


Val 


Ser 


Asn 


Pro 




140 










145 










150 










155 




gtg 


gau 


ate 


4- 4- ^ 

uta 


acc 


4- +- 

tat 


gta 


get 


tgg 


aag 


ttg 


agt 


gea 


ttt 


eec 


aaa 


685 


va.± 


Asp 


lie 


Leu 


Thr 


Tyr 


Val 


Ala 


Trp 


Lys 


Leu 


Ser 


Ala 


Phe 


Pro 


Lys 












lo 0 










165 










170 






a a o 


cgt 


a -h-h 
dt. L. 


a -h -1- 


gga 


age 


ggc 


tgt 


aat 


Ctg 


ata 


rung 


get 


cgt 


ttt 


cgt 


733 


Asn 


Arg 


lie 


lie 


ijiy 


Ser 


Cjiy 


Cys 


Asn 


Leu 


He 


Xaa 


Ala 


Arg 


Phe 


Arg 










175 










180 










185 










ucg 


auu 


99^ 


caa 


aag 


ett 


ggt 


ate 


eat 


tct 


gaa 


age 


tge 


eat 


gga 


781 


Jb'ne 


Leu 


lie 


Gly 


ril « 

Gin 


Lys 


Leu 


Gly 


He 


His 


Ser 


Glu 


Ser 


Cys 


His 


Gly 








190 










195 










200 










tgg 


ate 


etc 


gga 


gag 


eat 


gga 


gae 


tea 


agt 


gtt 


ect 


gtg 


tgg 


agt 


gga 


829 




lie 


Leu 


Gly 


Glu 


His 


Gly 


Asp 


Ser 


Ser 


Val 


Pro 


Val 


Trp 


Ser 


Gly 






2 05 










210 










215 














aac 


ata 


get 


ggt 


gte 


ect 


ttg 


aag 


gat 


ctg 


aac 


tct 


gat 


ata 


gga 


877 


va.x 


Asn 
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TV 1 

Ala 


Gly 


Vai 


Pro 


Leu 


Lys 


Asp 


Leu 


Asn 


Ser 


Asp 


He 


Gly 




"''i22 0 
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235 




^ ^act 


gat 


aaa 


gat 
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cae 
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Lys 
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Glu 


Gin 


Trp 


Lys 
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Val 
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Lys 


Glu 


Val 


Thr 
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tat 


gag 


att 


att 
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atg 
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tat 


act 


tet 


tgg 


gcc 


973 


A±a 


Thir 


Ala 


Tyr 


Glu 


Tl ^ 

lie 


He 


Lys 
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Lys 


Gly 


Tyr 


Thr 
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Trp 


Ala 
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^ -i- 4- 
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tct 


gtg 
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gat 


tta 
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gaa 


agt 


att 


ttg 


aag 


aat 


ett 


1021 






Leu 


Ser 


Vai 


Ala 


Asp 


Leu 


Thr 


Glu 


Ser 


He 


Leu 


Lys 


Asn 


Leu 
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275 










280 
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a -h ^ 

ana 


/-1 1 4- 

eat 


cea 


gtt 


tec 


ace 


ata 


act 


aag 


ggc 


etc 


tat 


gga 


ata 


1069 


w^Airg 


Airg 


lie 


His 


Pro 


Val 


Ser 


Thr 


He 


Thr 


Lys 


Gly 


Leu 


Tyr 


Gly 


He 




'-ill' 


o o c 
Zoo 
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irau 


gaa 


gaa 


gta 


4- 4- ^ 

ute 


etc 


agt 


att 


ect 


tgt 


ate 


ctg 


gga 


gag 


aac 


ggt 


1117 


Add 




r*l ^l^ 
ValU 


vai 


Fne 


Leu 


Ser 


He 


Pro 


Cys 


He 


Leu 


Gly 


Glu 


Asn 


Gly 




"3 n n 
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310 










315 




att 


acc 


aac 


ett 


ata 


aag 


ata 


aag 


ctg 


acc 


cet 


gaa 


gaa 


gag 


gee 


eat 


1165 


lie 


Thr 


Asn 


Leu 


He 


Lys 


He 


Lys 


Leu 


Thr 


Pro 


Glu 


Glu 


Glu 


Ala 


His 












320 










325 










330 






ctg 


aaa 


aaa 


agt 


gca 


aaa 


aca 


etc 


tgg 


gaa 


att 


cag 


aat 


aag 


ett 


aag 


1213 


Leu 


Lys 


Lys 


Ser 


Ala 


Lys 


Thr 


Leu 


Trp 


Glu 


He 


Gin 


Asn 


Lys 


Leu 


Lys 










335 










340 










345 







ett taaagttgee taaaactace attccgaaat tattgaagag atcatagata 1266 
Leu 

caggattata taacgaaatt ttgaataaac ttgaattcet aaaagatgga aaeaggaaag 1326 

taggtagagt gattttceta tttatttagt ectceagcte ttttattgag cateeaegtg 1386 

ctggaegata ettatttaea atteckaagt atttttggta cetctgatgt ageagcaett 1446 

gceatgttat atatatgtag ttgrtnatttg gtteecaaaa agtaggatgt aggtatttat 1506 

tgtgttctag aaattecgae tettttcatt agatatatge tatttcttte attettgctg 1566 

gtttataeet atgtteattt atatgetgta aaaaagtagt agcttettct acaatgtaaa 1626 

aataaatgta cataeaaaaa aaaaaamcme 1656 
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<210> 308 
<211> 517 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 48. .164 

<221> sig_j)eptide 
<222> 48. .89 
<223> Von Heijne matrix 
score 4 

seq YYMVCLFFRLIFS/EH 

<221> polyA_signal 
<222> 482. .487 

<221> polyA_site 
.p,^222> 505. .517 

■ f'^ 

!:;:^400> 308 

^%ggagatagc ctcgtagaaa tgacaaccac aatgttaata ctaacat atg tat tac 56 

Met Tyr Tyr 

^f^^tg gtt tgt ttg ttc ttt cgc tta ata ttt tea gag cac eta cct att 104 
^|!ket Val Cys Leu Phe Phe Arg Leu lie Phe Ser Glu His Leu Pro lie 
W -10 -5 15 

iJ^iata ggc act gtc act tct cae aaa act ggg aea eta act gtt tat cca 152 
lie Gly Thr Val Thr Ser His Lys Thr Gly Thr Leu Thr Val Tyr Pro 
10 15 20 

^^^saca tct get ggc taaataaaga catgatcttc accttttggg attgttaatt 204 
;^;^hr Ser Ala Gly 

'^%aaaatggtt ccataagagc aatgcaaaga cagagatatt tggcagcact gcagctggtg 264 
JJ.ktttatatgg ctcttcacaa ggtgttattt tggggtatca aggtatggat gcttaaatca 324 
Hfectgcaggaa gtaagaaaga agaaaaaagg agtgataaag ataaaaaaaa ateaaccttg 384 
gtccttecac eaaaacecat taattteeat atcatcatct gcataararg gaaaattcct 444 
acwtgaceag gttactgcaa ggatktkaat tttgaatatt aaaatattat mcmcaattgg 504 
aaaaaaaaaa aaa civ 



<210> 309 
<211> 405 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 185 . .334 

<221> sig_peptide 

<222> 185. .295 

<223> Von Heijne matrix 

score 5.90000009536743 
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seq LSYASSALSPCLT/AP 



<221> polyA_signal 
<222> 355. .360 

<221> polyA_site 
<222> 392 . .405 



<400> 309 

atcaccttct tctccatcct tstctgggcc agtccccarc ccagtccctc tcctgacctg 60 
cccagcccaa gtcagccttc agcacgcgct tttctgcaca cagatattcc aggcctacct 120 
ggcattccag gacctccgma atgatgctcc agtcccttac aagcgcttcc tggatgaggg 180 
tggc atg gtg ctg acc acc etc ccc ttg ccc tct gcc aac age cct gtg 22 9 
Met Val Leu Thr Thr Leu Pro Leu Pro Ser Ala Asn Ser Pro Val 
-35 -30 -25 

aac atg ccc acc act ggc ccc aac age ctg agt tat get age tct gcc 277 
Asn Met Pro Thr Thr Gly Pro Asn Ser Leu Ser Tyr Ala Ser Ser Ala 

-20 -15 -10 

ctg tec ccc tgt ctg acc get cca aag tec ccc ega ett get atg atg 325 
Leu Ser Pro Cys Leu Thr Ala Pro Lys Ser Pro Arg Leu Ala Met Met 
-5 15 10 

prfiet gac aac taaatatcct tatccaaatc aataaarwra raatcctccc 374 
^'■5'ro Asp Asn 

,;|'|:ccaraaggg tttctaaaaa caaaaaaaaa a 405 



=l^210> 310 
y:|<211> 1087 
yl<:212> DNA 
<213> Homo sapiens 

U fC220> 
[:J<221> CDS 
['^222> 195. ,347 

yllc221> sig_j>eptide 

i;j;ilc222> 195. .272 
<223> Von Heijne matrix 

score 7.09999990463257 
seq LASLQWSLTLAWC/GS 



<221> polyA_signal 
<222> 1037. .1042 



<221> polyA_site 
<222> 1071. .1082 



<400> 310 

aaagtgtaga acaeggacct ctgagttatg etcttgagag gtgccaaagc tgggctgttt 60 
acetacetta tccacagage tctgaaagtc aagceagaaa ggaaggatte caaattcttg 120 
gaattttatc tagaaaagaa gactaagcag cttttgttct tctgtgaccc agttgctggc 180 
ceaagacatg gaca atg acc ccc tgg tgt ttg gcg tgt ctg ggg agg agg 23 0 

Met Thr Pro Trp Cys Leu Ala Cys Leu Gly Arg Arg 
-25 -20 -15 

cct etc get tct ttg cag tgg age ctg aea ctg gcg tgg tgt ggc tec 278 
Pro Leu Ala Ser Leu Gin Trp Ser Leu Thr Leu Ala Trp Cys Gly Ser 
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-10 -5 1 

ggc age cac tgg aca gag aga cca akt cag akt tea ccg tgg akt tct 326 
Gly Ser His Trp Thr Glu Arg Pro Xaa Gin Xaa Ser Pro Trp Xaa Ser 

5 10 15 

ctg tea gcg acc acc agg ggg tgatcacacg gaaggtgaac atccaggtcg 377 
Leu Ser Ala Thr Thr Arg Gly 

20 25 

gggatgtgaa tgacaacgcg cccacatttc acaatcagcc ctacagcgtc cgcatccctg 437 

araatacacc agtggggacg cccatcttca tcgtgaatgc cacagacccc gacttggggg 497 

cagggggcag cgtcctctac tccttccagc ccccctccca attcttcgcc attgacagcg 557 

cccgcggtat cktcacagtg atccgggagc tggactacga taccacrcmg gcctaccagc 617 

tcwcggtcwa cgccacagat caagacaara ccaggcctct gtccaccstg gccaacttgg 677 

ccatcatcat cacagatgtc caggacatgg accccatctt catcaacctg ccttacagca 737 

ccaacatcta cgagcattct cctccgggca cgacggtgcg catcatcacc gccatagacc 797 

aggataaagg acgtccccgg ggcattggct acaccatcgt ttcagggcat ctgtgtttac 857 

aagaacccaa gatctctcag gagctcagga aaaggggctt gctgtgaggc tcagggttcc 917 

catggacatt ctgagctgac cctcctcagc attggatctc ctggctcagg aactaggaac 977 

gaagcttgga tgttttctcc tttcctacag catctgtatt catttcctat agttgccata 1037 

ataaaatgcc actaacttag tggcttaaaa accaaaaaaa aaaaaccctt 1087 



^=^J:210> 311 
::5^k211> 916 
P;^212> DNA 
f'k213> Homo sapiens 

^;|!<220> 
^|<221> CDS 
[fl<222> 90. .815 

t^.:<:221> sig_peptide 
Uf222> 90. .179 
r'i<:223> Von Heijne matrix 
f'' score 13.1999998092651 

seq LLLLSTLVIPSAA/AP 

0!|=:221> polyA_signal 
<222> 883 . . 888 

<221> polyA_site 
<222> 905. .916 

<400> 311 

aaaacagtac gtgggcggcc ggaatccggg agtccggtga cccgggctgt ggtctagcat 60 
aaaggcggag ccagaagaag gggcggggt atg gga gaa gcc tec cca cot gee 113 

Met Gly Glu Ala Ser Pro Pro Ala 
-30 -25 
cec gca agg egg eat ctg ctg gtc ctg ctg ctg etc etc tet ace ctg 161 
Pro Ala Arg Arg His Leu Leu Val Leu Leu Leu Leu Leu Ser Thr Leu 

-20 -15 -10 

gtg ate cec tec get gca get ect ate cat gat get gae gee caa gag 209 
Val He Pro Ser Ala Ala Ala Pro He His Asp Ala Asp Ala Gin Glu 

-5 1 5 10 

age tec ttg ggt etc aca ggc etc cag age eta etc eaa ggc ttc age 257 
Ser Ser Leu Gly Leu Thr Gly Leu Gin Ser Leu Leu Gin Gly Phe Ser 
15 20 25 
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C uC 


4- 4- ^ 

u uC 


etg 


aaa 
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etg 
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ggc 


ata 
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tta 


ttc 


305 


Arg 


Leu 


Phe 


Leu 


Lys 


Gly 


Asn 


Leu 


Leu 


Arg 


Gly He 


Asp 


Ser 


Leu 


Phe 










30 










35 










40 










gcc 


CCC 


acg 


gac 


ttc 


egg 


ggc 


etc 


ect 


ggg 


aac 


tac 


cac 


aaa 


gag 


353 


Ser 


Ala 


Pro 


Met 


Asp 


Phe 


Arg 


Gly 


Leu 


Pro 


Gly Asn 


Tyr 


His 


Lys 


Glu 


















50 










55 










gag 


aac 


cag 


gag 


cac 


cag 


etg 


ggg 


aac 


aac 


acc 


etc 


tec 


age 


cac 


etc 


401 




Asn 

b u 


vjxn 


blU 


ms 


Gin 


Leu 
ob 


Gly 


Asn 


Asn 


Thr 


Leu 
70 


Ser 


Ser 


His 


Leu 




cag 


auc 


gac 


aag 


atg 


acc 


gac 


aac 


aag 


aca 


gga 


gag 


gtg 


etg 


ate 


tec 


449 


Gin 


He 


Asp 


Lys 


Met 


Thr 


Asp 


Asn 


Lys 


Thr 


Gly Glu 


Val 


Leu 


He 


Ser 




f b 










80 










85 










90 




gag 


aat 


gtg 


gtg 


gca 


tec 


att 


eaa 


cca 


vcg 


gag 


ggg 


anc 


ttc 


gag 


ggt 


497 


Glu 


Asn 


Val 


Val 


Ala 


Ser 


He 


Gin 


Pro 


Xaa 


Glu Gly 


Xaa 


Phe 


Glu 


Gly 












95 










100 










105 






gac 


ucg 


aag 


gtii 


CCC 


agg 


atg 


gag 


gar 


aag 


gag 


gee 


etg 


gta 


CCC 


mte 


545 


Asp 


Leu 


Lys 


Val 
110 


Pro 


Arg 


Met 


Glu 


Glu 
115 


Lys 


Glu 


Ala 


Leu 


Val 
120 


Pro 


Xaa 




car 


aag 


gcc 


acg 


gac 


age 


ttc 


cac 


aca 


gaa 


etc 


eat 


CCC 


egg 


gtg 


gcc 


593 


Gin 


Lys 


TV 1 ^ 

Ala 
125 


Thr 


Asp 


Ser 


Phe 


His 
130 


Thr 


Glu 


Leu 


His 


Pro 
135 


Arg 


Val 


Ala 






tgg 


ate 


att 


aag 


etg 


cca 


egg 


egg 


agg 


tee 


cae 


cag 


gat 


gcc 


etg 


641 




Trp 


He 


He 


Lys 


Leu 


Pro 


Arg 


Arg 


Arg 


Ser 


His 


Gin 


Asp 


Ala 


Leu 
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145 










150 












n^ag 


ggc 


ggc 


cac 


tgg 


etc 


anc 


gar 


aag 


cga 


cac 


cgc 


etg 


cag 


gee 


ate 


689 


i'"telu 


Gly 


Gly 


His 


Trp 


Leu 


Xaa 


Glu 


Lys 


Arg 


His 


Arg 


Leu 


Gin 


Ala 


He 
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165 
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gat 


gga 


etc 


cgc 


aag 


ggg 


ace 


cae 


aag 


gac 


rte 


eta 


daa 


rag 


ggg 


737 


ijf^rg 


Asp 


Gly 


Leu 


Arg 


Lys 


Gly 


Thr 


His 


Lys 


Asp 


Xaa 


Leu 


Xaa 


Xaa 


Gly 












175 










180 










185 






acc 


gar 


age 


tec 


tec 


cac 


tec 


agg 


etg 


tec 


CCC 


cga 


aar 


amm 


cac 


tta 


785 


Thr 


Glu 


Ser 


Ser 


Ser 


His 


Ser 


Arg 


Leu 


Ser 


Pro 


Arg 


Lys 


Xaa 


His 


Leu 
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195 










200 








i^rbtg 


tac 


ate 


etc 


arg 


CCC 


tet 


egg 


cag 


etg 


targggtggg gaceggggar 


835 


N^eu 


Tyr 


He 


Leu 


Xaa 


Pro 


Ser 


Arg 


Gin 


Leu 




















205 










210 





















^llF^^^tgectg tagceeecat earaccetgc cccaagcace atatggaaat aaagttcttt 895 
rvcttacatcca aaaaaaaaaa a 915 



<210> 312 
<211> 583 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 52 . .513 

<221> sig__peptide 
<222> 52 . .231 
<223> Von Heijne matrix 
score 4 

seq LVRRTLLVAALRA/WM 
<221> polyA__signal 
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<222> 553 . .558 

<221> polyA_site 
<222> 572. .583 

<400> 312 

aaggaaacag caaccagagg gagatgatca cctgaaccac tgctccaaac c atg ggc 57 

Met Gly 
-60 



agt 


aaa 


tgc 


tgt 


aaa 


ggt 


ggt 


eea 


gat 


gaa 


gat 


gca 


gta 


gaa 


aga 


cag 


105 


Ser 


Lys 


Cys 


Cys 


Lys 


Gly 


Gly 


Pro 


Asp 


Glu 


Asp 


Ala 


Val 


Glu 


Arg 


Gin 










-55 










-50 










-45 








agg 


egg 


cag 


aag 


ttg 


ett 


ett 


gca 


eaa 


etg 


cat 


cac 


aga 


aaa 


agg 


gtg 


153 


Arg 


Arg 


Gin 


Lys 


Leu 


Leu 


Leu 


Ala 


Gin 


Leu 


His 


His 


Arg 


Lys 


Arg 


Val 








-40 










-35 










-30 










aar 


gca 


get 


ggg 


cag 


ate 


cag 


gee 


tgg 


tgg 


egt 


ggg 


gtc 


etg 


gtg 


cgc 


201 


Lys 


Ala 


Ala 


Gly 


Gin 


He 


Gin 


Ala 


Trp 


Trp 


Arg 


Gly 


Val 


Leu 


Val 


Arg 






-25 










-20 










-15 
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acc 


etg 


etg 


gtt 


get 


gee 


etc 


agg 


gee 


tgg 


atg 


att 


cag 


tge 


tgg 


249 


Arg 


Thr 


Leu 


Leu 


Val 


Ala 


Ala 


Leu 


Arg 


Ala 


Trp 


Met 


He 


Gin 


Cys 


Trp 




-10 










-5 










1 








5 






agg 


aeg 


ttg 


gtg 


cag 


aga 


egg 


ate 


egt 


cag 


egg 


egg 


cag 


gee 


etg 


297 




Arg 


Thr 


Leu 


Val 


Gin 


Arg 


Arg 


He 


Arg 


Gin 


Arg 


Arg 


Gin 


Ala 


Leu 










10 










15 










20 








flbtr 


ggg 


gtc 


tac 


gte 


ate 


cag 


gag 


cag 


gcg 


gcg 


gtc 


aag 


etc 


cag 


tec 


345 




Gly 


Val 


Tyr 


Val 


He 


Gin 


Glu 


Gin 


Ala 


Ala 


Val 


Lys 


Leu 


Gin 


Ser 








25 










30 










35 












ate 


cgc 


atg 


tgg 


cag 


tge 


egg 


eaa 


tgt 


tac 


cgc 


eaa 


atg 


tge 


aat 


3 93 




He 


Arg 


Met 


Trp 


Gin 


Cys 


Arg 


Gin 


Cys 


Tyr 


Arg 


Gin 


Met 


Cys 


Asn 




40 










45 










50 












'"^ get 


etc 


tge 


ttg 


ttc 


cag 


gtc 


eea 


aaa 


age 


age 


ett 


gee 


ttc 


eaa 


act 


441 


Ala 


Leu 


Cys 


Leu 


Phe 


Gin 


Val 


Pro 


Lys 


Ser 


Ser 


Leu 


Ala 


Phe 


Gin 


Thr 














60 










65 










70 




!^:,bat 


ggc 


ttt 


tta 


cag 


gtc 


eaa 


tat 


gca 


ate 


cet 


tea 


aag 


cag 


eea 


gag 


489 


H^sp 


Gly 


Phe 


Leu 


Gin 


Val 


Gin 


Tyr 


Ala 


He 


Pro 


Ser 


Lys 


Gin 


Pro 


Glu 












75 










80 










85 








cac 


att 


gaa 


ate 


eta 


tea 


ate 


tgaaaggcct ggggeatgga gaacaggctg 


543 




His 


He 


Glu 


He 


Leu 


Ser 


He 





















90 



eactaeecta ataaatgtct gaecaggtaa aaaaaaaaaa 583 



<210> 313 
<211> 697 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 172. .438 

<221> sig_j)eptide 

<222> 172. .354 

<223> Von Heijne matrix 

score 4.69999980926514 
seg LLPCNLHCSWLHS/SP 
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<221> polyA_signal 
<222> 682. .687 

<221> polyA_site 
<222> 685. .697 



<400> 313 

agattggctg ggcagatggg ctgactggct gggcagatgg gtgggtgagt tccctctccc 60 
cagagccatc ggccaggtac caaagctcag ctgtatggat tcccaacagg aggacctgcg 12 0 
cttccctggg acccattgtt gtactggatt aacaagcgac ggcgctacgg c atg aat 177 

Met Asn 
-60 

gca gcc ate aac acg ggc cct gcc cct get gtc acc aag act gag act 225 
Ala Ala He Asn Thr Gly Pro Ala Pro Ala Val Thr Lys Thr Glu Thr 

-55 -50 -45 

gag gtc cag aat cca gat gtt ctg tgg gat ttg gac ate ccc gaa gcc 273 
Glu Val Gin Asn Pro Asp Val Leu Trp Asp Leu Asp He Pro Glu Ala 

-40 -35 -30 

^99 cat get gac eaa gae age aae ccc aag geg gaa gee ctg etc 321 

Arg Ser His Ala Asp Gin Asp Ser Asn Pro Lys Ala Glu Ala Leu Leu 
-25 ; -20 -15 

'^f ce tgc aac ctg eac tge age tgg etc cac age age ccc agg cca gat 369 
Ibro Cys Asn Leu His Cys Ser Trp Leu His Ser Ser Pro Arg Pro Asp 
II -10 -5 1 5 

-k;cc eat tec eac tte cca tet kte agg agg tge cct ttg eee cac cct 417 
j*|Pro His Ser His Phe Pro Ser Xaa Arg Arg Cys Pro Leu Pro His Pro 
j; 10 15 20 

jfgt gca ace tac eee ccs kgc tgaaceaetc tgtcteetat eetttggeea 468 
■:J:ys Ala Thr Tyr Pro Pro Xaa 

^ eetgteetga aaggaatgtt ctcttccatt ccctectgaa tctggcccag gaagaecata 528 
■%ettcaatgy caagcctttt cetteaaaac tgtagcctec tctcactgaa ggtgggaget 588 
^i&caggaatca ggtgcagagt aggaaatgga actaacctea ggaaggtggt attgacagag 648 
^^igteaggaecc aeetggatgt catgctatga aaeattaaaa gaaaaaaaa 697 



''<210> 314 

<211> 803 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 148 . .366 



<221> sig_peptide 
<222> 148 . .225 
<223> Von Heijne matrix 
score 5 . 5 

seq LFTLLFLIMLVLK/LD 



<221> polyA_signal 
<222> 770. .775 



<221> polyA_site 
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<222> 792. .803 



<400> 314 

aaatgggggg aaaagggcgg aaaaggacaa ggatccaaac tggcgaattt gctgatcttc 60 
gcgtccctct ccgctttccg gccggcagcg ctgccagggt atatttcctt ttttccgatc 120 
ctgcaacagc ctctttaaac tgtttaa atg aga atg tec ttg get cag aga gta 174 

Met Arg Met Ser Leu Ala Gin Arg Val 
-25 -20 



eta 


ete 


ace 


tgg 


ett 


ttc 


aca 


eta 


etc 


ttc 


ttg 


ate 


atg 


ttg gtg 


ttg 


222 
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Leu 


Thr 
-15 


Trp 


Leu 


Phe 


Thr 


Leu 
-10 


Leu 
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Leu 


He 


Met 
-5 


Leu 


Val 


Leu 
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etg 


gat 


gag 


aaa 


gea 


cet 


tgg 


aac 


tgg 


ttc 


etc 


ata 


ttc 


att 


cca 


270 


Lys 


Leu 
1 


Asp 


Glu 


Lys 


Ala 
5 


Pro 


Trp 


Asn 


Trp 


Phe 
10 


Leu 


He 


Phe 


He 


Pro 
15 




gtc 


tgg 


ata 


ttt 


gat 


act 


ate 


ctt 


ett 


gte 


etg 


etg 


att 


gtg 


aaa 


atg 


318 


Val 


Trp 


He 


Phe 


Asp 
20 


Thr 


He 


Leu 


Leu 


Val 
25 


Leu 


Leu 


He 


Val 


Lys 
30 


Met 




get 


ggg 


egg 


tgt 


aag 


tct 


ggc 


ttt 


gae 


etc 


gae 


atg 


gat 


cae 


aca 


ata 


366 


Ala 


Gly 


Arg 


Cys 


Lys 


Ser 


Gly 


Phe 


Asp 


Leu 


Asp Met 


Asp 


His 


Thr 


He 










35 










40 










45 









taaaaaaaaa aaeetggtac etcattgeac tgtkaettaa attascctte tgcetcgeae 426 

pf|tctgtgctaa actggaaeag tttactacca tgaatctate etatgtctte attcetttat 486 

ji^ggcettget ggetgggget ttaacagaac teggatataa tgt ctt t ttt gtgaaagact 546 

J^'^aettctaag tacateatet cetttctatt getgttcaac aagttaccat taaagtgttc 606 

J'^tgaatetgte aagcttcaag aataecagag aaetgaggga aaataecaaa tgtagtttta 666 

f'I^^*^^^^ttcc ataaaaeagg attggtgaat caeggacttc tagtcaaect acagcttaat 726 

y tatteageat ttgagttatt gaaatcetta ttatetctat gtaaataaag tttgttttgg 786 

■^l?accteaaaaa aaaaaaa 303 



H^^l=5210> 315 
J{^211> 823 
r^212> DNA 
I <:21Z> Homo sapiens 

yll<220> 
0|<221> CDS 

<222> 175. .336 



<221> sig_peptide 

<222> 175. .276 

<223> Von Heijne matrix 

score 3.70000004768372 
seq SVLNVGHLLFSSA/CS 



<221> polyA__site 
<222> 812 . . 823 



<400> 315 

aaggcgcgeg cgaecggcgg ctetttggcg eggattaggg ggtcteggeg agggagteat 60 
caagctttgg tgtatgtgtt ggecggttct gaagtcttga agaagctetg etgaggaaga 120 
ceaaagcagc actegttgee aattagggaa tggaeegttt gggttccttt agca atg 177 

Met 

ate cet etg ata age eac ett gee gag get get cet cet ace tea tgg 225 
He Pro Leu He Ser His Leu Ala Glu Ala Ala Pro Pro Thr Ser Trp 
-30 -25 -20 
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agc ctt ata tea agt gtg ctg aat gtg ggc cac etc ctt ttt tec tet 273 
Ser Leu lie Ser Ser Val Leu Asn Val Gly His Leu Leu Phe Ser Ser 

-15 -10 -5 

get tgc agt gtt tea etc gag get ttg agt aea aga aac ate aaa geg 321 
Ala Cys Ser Val Ser Leu Glu Ala Leu Ser Thr Arg Asn He Lys Ala 

15 10 15 

ate ata ett atg aaa taatggcttc agattttcct gtccttgatc ccagctggac 376 
He lie Leu Met Lys 
20 

tgctcaagaa raaatggccc ttttagaasc tgtgatggae tgtggctttg gaaattggca 436 
ggatgtagcc aatcaaatgt gcaceaarac caaggaggag tgtgagaagc actatatgaa 496 
gcatttcate aataaeccyc tgtttgcatc trsectgctg aacctgaaac aascagrgga 556 
agcaaaaact gctgaeacag ccattccatt tcaetctaea ratgaccetc eccgacckac 616 
ctttgactcc ttgctttctc gggaeatggc cgggtacwtg cemgctcgag cagattteat 676 
tgaggaattt gacaattatg cagaatggga cttgagagac attgattttg ttgaagatga 736 
cteggacatt ttacatgetc tgaagatggc tgtggtagat atctatcatt ccaggttaaa 796 
ggagagacaa agacgaaaaa aaaaaaa 823 



<210> 316 
^j^:<211> 823 
%c212> DNA 
Jl!<213> Homo sapiens 

t^220> 
'^k221> CDS 
f<222> 191. .553 

Jl<221> sig_j)eptide 

: <222> 191. .304 

,|,<223> Von Heijne matrix 

score 5.69999980926514 
J' seq LAFLSCLAFLVLD/TQ 

'";^221> polyA_signal 
li:222> 766. .771 

<221> polyA_site 
<222> 804. .817 



<400> 316 

aactctgcag ggcctccaag gccaggctte agggctggga ctcagtcctg aggeactggg 60 
gagccatgag gggctgtggc agggagggge agggtgtgga aagactccce tggggccatg 120 
gtggagatgt getgaggtct tcteectgat cgtcttetcc tccctgctga ccgacggeta 180 
ecagaaekag atg gag tct ccg eag etc cac tgc att etc aac age aac 229 
Met Glu Ser Pro Gin Leu His Cys He Leu Asn Ser Asn 
-35 -30 
age gtg gee tgc age ttt gee gtg gga gee ggc ttc ctg gee ttc etc 277 
Ser Val Ala Cys Ser Phe Ala Val Gly Ala Gly Phe Leu Ala Phe Leu 
-25 -20 -15 -10 

age tgc ctg gee ttc etc gtc etg gae aca eag gag ace cge att gee 325 
Ser Cys Leu Ala Phe Leu Val Leu Asp Thr Gin Glu Thr Arg He Ala 

-5 1 5 

ggc acc cge ttc aag aca gcc ttc cag etc ctg gac ttc ate ctg get 373 
Gly Thr Arg Phe Lys Thr Ala Phe Gin Leu Leu Asp Phe He Leu Ala 
10 15 20 
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gtt etc tgg gca gtt gtc tgg ttc atg ggt ttc tgc ttc ctg gcc aac 421 
Val Leu Trp Ala Val Val Trp Phe Met Gly Phe Cys Phe Leu Ala Asn 

25 30 35 

caa tgg cag cat teg ccg ccc aaa gar kkc etc ctg ggg age age agt 469 
Gin Trp Gin His Ser Pro Pro Lys Glu Xaa Leu Leu Gly Ser Ser Ser 
40 45 50 55 

gcc eag gca gcc ate ggc stt eac ctt ctt etc cat ect tgt ctg gat 517 
Ala Gin Ala Ala lie Gly Xaa His Leu Leu Leu His Pro Cys Leu Asp 

SO 65 70 

att cea rge eta ect ggc akk eca gga ect ccg aaa tgatgctcca 563 
lie Pro Xaa Leu Pro Gly Xaa Pro Gly Pro Pro Lys 

75 80 
gtcccttacm aregcttcet ggatgaaggt ggcatggtgs kkaaeacect ceccttgece 623 
tctgccaaca gcetgtgaac atgcccacca ctggceecaa cagcetgagt tatgctaget 683 
ctgccctgtc cccctgtctg accgetcmaa agteeccecg gcttgctatg atgcctgaca 743 
aetaaatate ettatceaaa tcaataaaga gagaatcctc cctccagaag ggtttctaaa 803 
aacaaaaaaa aaaahncctt po-j 



<210> 317 
f*l<211> 1112 
;|<^<212> DHA 
,;^';^213> Homo sapiens 

f:!!<22 0> 
''fj<221> CDS 
^Cy:222> 106 . .603 

Ii1<221> sig_peptide 

ii; <222> 106. .216 

^;sj!,<223> Von Heijne matrix 

nj score 4.30000019073486 

iJl seq LWEKLTLLSPGIA/VT 

'^Jc221> polyA^site 
^f^^222> 1102 . .1112 

<400> 317 

agcgattgcg aatcctccgc tgaggtgatt tggatatcce tagaacgttg agggcacgag 60 
tcgggtcctg agaccaggtc ctcagceage agagceaegt tcctt atg age ace gtg 117 



Met Ser Thr Val 
-35 
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ttt 


cat 


ttt 


ect 


aca 


eca 


ctg 


ace 


cga 


ata 


tgc 


ccg 


gcg 


eca 
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Gly 
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Thr 


Arg 
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Pro 


Ala 


Pro 
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ata 
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Trp 


Gly 


Leu 


Arg 


Leu 


Trp 


Glu 


Lys 


Leu 


Thr 


Leu 


Leu 


Ser 


Pro 


Gly 


He 
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get 


gtc 


act 


ccg 


gtc 


eag 


atg 


gca 


ggc 


aag 


aag 


gac 


tac 


ect 


gca 


ctg 


261 


Ala 


Val 


Thr 


Pro 


Val 
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Met 


Ala 


Gly 
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Lys 


Asp 
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Pro 


Ala 


Leu 
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5 










10 










15 




ctt 


tee 


ttg 


gat 


gag 


aat 


gaa 


etc 


gaa 


gag 


cag 


ttt 


gtg 


aaa 


gga 


eac 


309 


Leu 


Ser 


Leu 


Asp 


Glu 
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Glu 


Leu 


Glu 


Glu 


Gin 


Phe 


Val 


Lys 


Gly 


His 
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aaa 


ace 


age 


aac 
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gtg 


gtg 


ctg 


357 


Gly 


Pro 


Gly 


Gly 


Gin 


Ala 


Thr 


Asn 


Lys 


Thr 


Ser 


Asn 
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Val 


Val 


Leu 
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aar 
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ate 
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ate 
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cag 


aca 


aga 


tea 
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Lys 
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Ser 


Gly 


lie 


Val 


Val 


Lys 
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His 


Gin 


Thr 


Arg 


Ser 
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55 










60 








gtt 
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cag 


aac 


aga 


aag 


eta 


get 


egg 


aaa 


ate 


eta 


eaa 


gag 


aaa 


gta 


453 


Val 


Asp 


Gin 


Asn 


Arg 


Lys 


Leu 


Ala 


Arg 


Lys 


He 


Leu 


Gin 


Glu 


Lys 


Val 






55 
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get: 
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aae 


agt 
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gaa 


aaa 


cga 
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Aaa 


vai 
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Glu 
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Pro 


Val 


His 
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Glu 


Lys 


Arg 




o U 
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y ctct 




gcg 


aag 


aaa 


aaa 


car 


gaa 


agg 


aaa 


aaa 


aga 


gca 


aag 


gaa 


ace 


549 


Glu 


Ala 


Ala 


Lys 


Lys 
100 


Lys 


Gin 


Glu 


Arg 


Lys 
105 


Lys 


Arg 


Ala 


Lys 


Glu 
110 


Thr 




ctg 


gaa 


aaa 


aag 


aas 


ctm 


ett 


aaa 


raa 


ctg 


tgg 


gag 


tea 


agt 


aaa 


aag 


597 


Leu 


Glu 


Lys 


Lys 


Xaa 


Leu 


Leu 


Lys 


Xaa 


Leu 


Trp 


Glu 


Ser 


Ser 


Lys 


Lys 










115 










120 










125 






gtc 


cac 


tgagaaaaga attagagatt ccaactgaca gaatctgcca 


gaagctccea 


653 



Val His 

gggaataatg gtggcgagtt 

ttgatgaaet taaaagaeaa 

gacatgcaeg atteaagaat 

^::i.jaateccagea etttgggagg 

^'lijagcctggcca acatggtgaa 

^J'^gtggcgggc aectgtaatc 

pfctgggaggcg gaggttgcag 

^''li'cctgggtgac agagcaagac 



y^|<210> 318 

<211> 1623 

L|<212> DNA 

S^ji<213> Homo sapiens 

!;'''<22o> 

'''';jk221> CDS 
^^J!k222> 47 . .586 

<221> sig_jpeptide 
<222> 47. .124 
<223> Von Heijne matrix 

score 6.30000019073486 

seq GVGLVTLLGLAVG/SY 

<221> polyA_signal 
<222> 1583 . . 1588 

<221> polyA_site 
<222> 1614. .1623 

<400> 318 

agggatctgt eggcttgtca ggtggtggag gaaaaggegc tccgtc atg ggg ate 55 

Met Gly He 
-25 

cag aeg age ccc gtc ctg ctg gee tee ctg ggg gtg ggg ctg gtc act 103 
Gin Thr Ser Pro Val Leu Leu Ala Ser Leu Gly Val Gly Leu Val Thr 
-20 -15 -10 



ecatcaccag cattattata gtgcttcaaa agaaatattt 713 

eaaatttatt taaatggtgc aetaaaetgt agtgaacaga 773 

aaaactcgge cgggcacggt ggacggtgcc tcaeatctgt 833 

ccgaggcggg cggateaett gaggteagga gtttgagaec 893 

aeeecgtetc tactaaaaat acaaaaaatt agccaggcat 953 

ccagctacte gggaggeega ggcaggagaa ttgcgtgaac 1013 

tgagctgaga tegcgeeact geactcaage ctgggcaaca 1073 

cccatcycaa aaaaaaaaa 1112 



-256- 



ctg etc ggc ctg get gtg ggc tec tac ttg gtt egg agg tec ege egg 151 
Leu Leu Gly Leu Ala Val Gly Ser Tyr Leu Val Arg Arg Ser Arg Arg 

-5 15 
cct cag gtc act etc ctg gac ccc aat gaa aag tac ctg eta cga ctg 199 
Pro Gin Val Thr Leu Leu Asp Pro Asn Glu Lys Tyr Leu Leu Arg Leu 
10 15 20 25 

eta gac aag aeg act gtg age cac aac ace aag agg tte ege ttt gee 247 
Leu Asp Lys Thr Thr Val Ser His Asn Thr Lys Arg Phe Arg Phe Ala 

30 35 40 

etg ccc ace gee cac cac act ctg ggg ctg cct gtg ggc aaa eat ate 295 
Leu Pro Thr Ala His His Thr Leu Gly Leu Pro Val Gly Lys His lie 

45 50 55 

tac etc tee acm mga att gat ggc age ctg gtc ate agg cca tac act 343 
Tyr Leu Ser Thr Arg He Asp Gly Ser Leu Val He Arg Pro Tyr Thr 

60 65 70 

cct gtc ace agt gat gag gat caa ggc tat gtg gat ett gtc mtc aag 391 
Pro Val Thr Ser Asp Glu Asp Gin Gly Tyr Val Asp Leu Val Xaa Lys 

75 80 85 

gtc tac ctg aag ggt gtg cac ccc aaa ttt cct gag gga ggg aar atg 439 
Val Tyr Leu Lys Gly Val His Pro Lys Phe Pro Glu Gly Gly Lys Met 
90 95 100 105 

flfct cak tac etg gat ase etg aaa gtt ggg gat btg gtg gaa ttt csg 487 
'■|ser Xaa Tyr Leu Asp Xaa Leu Lys Val Gly Asp Xaa Val Glu Phe Xaa 

110 115 120 

I'^SS cca age ggg ttg etc act tac act gga aaa ggg cat ttt aac att 535 
j^'tSly Pro Ser Gly Leu Leu Thr Tyr Thr Gly Lys Gly His Phe Asn He 
HI 125 130 135 

1c:ag ccc aac aag aat etc cac cag aac ccc gag tgg cga aga aac tgg 583 
L^jbln Pro Asn Lys Asn Leu His Gin Asn Pro Glu Trp Arg Arg Asn Trp 
yi 140 145 150 

gaa tgattgccgg cgggacagga atcaccccaa tgctacagct gatccgggcc 636 

i.plu 

ptectgaaag teectgaaga tccaaeccag tgetttctgc tttttgeeaa ceagaeagaa 696 
l^.aaggatatca tettgeggga ggaettagag gaaetgeagg ecegctatcc eaategettt 756 
pagctetggt tcactetgga tcatecccca aaagrttggg cetacagcaa gggctttgtg 816 
^%etgccgacw tgatceggga acacetgcce gctccagggg atgatgtgct ggtactgctt 876 
Ijtgtgggceme ceeeaatggt geagctggee tgceatccea aettggaeaa actgggetac 936 
y;tcacaaaaga tgegattcac ctactgagca tcctecaget tceetggtgc tgttcgctgc 996 
agttgttcce cateagtaet caagcaetak aagcettagr kteetktect eagagtttca 1056 
ggttttttca gttrsatcka gagctgaaat etggatagta cetgeaggaa caatattcet 1116 
gtagecatgg aagagggcca aggetcagtc actccttgga tggeetccta aatetccceg 1176 
tggeaacagg teeaggagag geccatggag cagtctettc catggagtaa gaaggaaggg 1236 
agcatgtacg cttggtceaa gattggetag ttcettgata geatcttact etcaccttet 1296 
ttgtgtetgt gatgaaagga aeagtctgtg eaatgggttt tacttaaact tcaetgttca 1356 
aeetatgagc aaatetgtat gtgtgagtat aagttgagca tagcatactt ccagaggtgg 1416 
tettatggag atggeaagaa aggaggaaat gatttettea gatetcaaag gagtetgaaa 1476 
tatcatattt etgtgtgtgt cdetctcagc ecctgceead getagaggga wacagetaet 1536 
gataatcgaa aactgetgtt tgtgggearg aacceetgge tgtgcaaata atggggetga 1596 
ngccetgtgt gatattgaaa aaaaaaa 1623 



<210> 319 
<211> 526 
<212> DNA 
<213> Homo sapiens 



-257- 

<220> 
<221> CDS 
<222> 99. .371 

<221> sig_peptide 

<222> 99, .290 

<223> Von Heijne matrix 

score 3.79999995231628 

seq LFIWCVICVTLN/FP 

<221> polyA_signal 
<222> 491. .496 

<221> polyA_site 
<222> 513. .524 

<400> 319 

attggattag tagaattgct tttgtcattc cattgttttc atatatttgt ttgggacatt 60 
ttactttttt ctgttaacgc ttaccctagr aattagaa atg aca cca cgt att ctt 116 

Met Thr Pro Arg lie Leu 
-60 



r^'l^gc 


gaa 


gtc 


cag 


ttt 


tea 


gea 


ttt 


tgt 


eet 


tat 


tgg 


aca 


ata 


gea 


agg 


164 


j^iSer 


Glu 


Val 


Gin 


Phe 


Ser 


Ala 


Phe 


Cys 


Pro 


Tyr 


Trp 


Thr 


He 


Ala 


Arg 




!; '';^ta 






-55 










-50 










-45 






tta 


gaa 


cgt 


gtt 


ggt 


tec 


gcg 


tge 


ttc 


cgt 


ctt 


gag 


tta 


tgt 


get 


212 


J:J:iie 


Leu 


Glu 


Arg 


Val 


Gly 


Ser 


Ala 


Cys 


Phe 


Arg 


Leu 


Glu 


Leu 


Cys 


Ala 




jlligct 




-40 










-35 










-30 








att 


gtc 


gga 


tat 


ttt 


gtc 


tta 


gat 


gta 


cgt 


act 


ttc 


ctg 


ttc 


att 


260 


llikla 


He 


Val 


Gly 


Tyr 


Phe 


Val 


Leu 


Asp 


Val 


Arg 


Thr 


Phe 


Leu 


Phe 


He 






-25 










-20 










-15 












- gtg 


gta 


tgt 


gta 


att 


tge 


gtt 


act 


ttg 


aat 


ttt 


cca 


cgt 


ttt 


tac 


ttt 


308 


^.^yai 

r^rlO 


Val 


Cys 


Val 


He 


Cys 
-5 


Val 


Thr 


Leu 


Asn 


Phe 


Pro 


Arg 


Phe 


Tyr 


Phe 




rjjctt 


tgt 


etc 


tea 


tea 


ctt 


aec 


get 


ttt 


ggg 


1 

ace 


ecc 


cec 


ate 


5 

ggg 


gtt 


356 


Leu 


Cys 


Leu 


Ser 


Ser 


Leu 


Thr 


Ala 


Phe 


Gly 


Thr 


Pro 


Pro 


He 


Gly 


Val 





15 2 0 

■fjiP^*^ tararcacac tceettggat ttcctcradt ggggtetget 411 

yjHis He Pro Ser Pro 



25 

gcggtgaagc tttcccattt tatgtgeaga ttattttcag agggtatata gaattcagge 471 
agctgttteg ttgtageaca ttaaaaatat tttceeaett eaaaaaaaaa aaace 526 



<210> 320 
<211> 989 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 44 . .814 

<221> sig_peptide 

<222> 44. .112 

<223> Von Heijne matrix 

score 8.30000019073486 



-258- 



seq VRLLLXLLLLLIA/LE 

<221> polyA_site 
<222> 978. .989 

<400> 320 

aaatgtgtac acgcccagct tcctgcctgt tactctccac agt atg cga aga ata 55 

Met Arg Arg lie 
-20 

tec ctg act tct age cct gtg cgc ctt ctt ttg tdt ctg ctg ttg eta 103 
Ser Leu Thr Ser Ser Pro Val Arg Leu Leu Leu Xaa Leu Leu Leu Leu 

-15 -10 -5 

eta ata gee ttg gag ate atg gtt ggt ggt cac tct ctt tgc ttc aae 151 
Leu He Ala Leu Glu He Met Val Gly Gly His Ser Leu Cys Phe Asn 

15 10 
tte act ata aaa tea ttg tec aga cct gga cag ecc tgg tgt gaa geg 199 
Phe Thr He Lys Ser Leu Ser Arg Pro Gly Gin Pro Trp Cys Glu Ala 

15 20 25 

eat gte tte ttg aat aaa aat ett ttc ctt cag tac aae agt gac aac 247 
His Val Phe Leu Asn Lys Asn Leu Phe Leu Gin Tyr Asn Ser Asp Asn 
30 35 40 45 

^...^ac atg gtc aaa cct ctg ggc etc ctg ggg aag aag gta tat gee ace 295 
^^:^sn Met Val Lys Pro Leu Gly Leu Leu Gly Lys Lys Val Tyr Ala Thr 
J;5l 50 55 60 

;^^^gc act tgg gga gaa ttg ace caa acg ctg gga gaa gtg ggg cga gac 343 
J^Jer Thr Trp Gly Glu Leu Thr Gin Thr Leu Gly Glu Val Gly Arg Asp 
Ul 65 70 75 

^Sctc agg atg etc ctt tgt gac ate aaa ecc ear ata aag ace agt gat 391 
y^lLeu Arg Met Leu Leu Cys Asp He Lys Pro Gin He Lys Thr Ser Asp 
yi 80 85 90 

cct tec act ctg caa gte kar atk ttt tgt caa cgt gaa gca gaa egg 439 
h^^^ Ser Thr Leu Gin Val Xaa Xaa Phe Cys Gin Arg Glu Ala Glu Arg 
ni ^5 100 105 

rr^gc act ggt gca tec tgg cag ttc gee ace aat gga gag aaa tee etc 487 
l^pys Thr Gly Ala Ser Trp Gin Phe Ala Thr Asn Gly Glu Lys Ser Leu 
■jy-lO 115 120 125 

^IjJctc ttt gac gca atg aac atg ace tgg aca gta att aat eat gaa gee 535 
i;;;|Leu Phe Asp Ala Met Asn Met Thr Trp Thr Val He Asn His Glu Ala 
130 135 140 

agt wag ate aag gag aca tgg aag aaa gac aga ngg ctg gaa aak tat 583 
Ser Xaa He Lys Glu Thr Trp Lys Lys Asp Arg Xaa Leu Glu Xaa Tyr 

145 150 155 

ttc agg aag etc tea aar gga gac tgc gat cae tgg etc agg gaa ttc 631 
Phe Arg Lys Leu Ser Lys Gly Asp Cys Asp His Trp Leu Arg Glu Phe 

160 165 170 

tta ggg eac tgg gaa gca atg cca raa ccg ama gtg tern cea rta aat 679 
Leu Gly His Trp Glu Ala Met Pro Xaa Pro Xaa Val Ser Pro Xaa Asn 

175 180 185 

get tea raw ate cac tgg tct tct tct art eta eca raw ara tgg ate 727 
Ala Ser Xaa He His Trp Ser Ser Ser Xaa Leu Pro Xaa Xaa Trp He 
150 195 200 205 

ate ctg ggg gca tte ate ctg tta vtt tta atg gga att gtt etc ate 775 
He Leu Gly Ala Phe He Leu Leu Xaa Leu Met Gly He Val Leu He 

210 215 220 

tgt gtc tgg tgg caa aat ggc ara ara tec acc tad arg tgataccacg 824 
Cys Val Trp Trp Gin Asn Gly Xaa Xaa Ser Thr Xaa Xaa 
225 230 



-259- 



gcggcgcaaa attgttcacc tgtggtcctc gatcgctgac agccttggct cccactgctg 884 
tgtgttccct gagtcaagtg gaggcggagc ctgcaatgag cggaratcgc gcctctgcat 944 
tccagtcttg gcaacagarc aagactccgt ctcaaaaaaa aaaaa 989 



<210> 321 

<211> 1017 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 3, .581 

<221> sig_jpeptide 

<222> 3 . . 182 

<223> Von Heijne matrix 

score 6.69999980926514 

seq LWPFLTWINPALS/IC 

;;.t^<221> polyA_site 
;t<222> 1006. .1016 

P;;^<400> 321 

ccc agt gtc aag ggg act ctg 47 
Met Cys Pro Ser Leu Glu Glu Ala Pro Ser Val Lys Gly Thr Leu 
4^ -60 -55 -50 

yjccc tgc tea gga caa cag cag cct ttc ccg ttt gga gcc tea aac ate 95 

[P|Pro Cys Ser Gly Gin Gin Gin Pro Phe Pro Phe Gly Ala Ser Asn He 

, -45 -40 -35 -30 

f.j^cca eta etc ctg gge agg age aga aag gtg get cga ggt gca ccg gtc 143 

'ripro Leu Leu Leu Gly Arg Ser Arg Lys Val Ala Arg Gly Ala Pro Val 

ll "^^ "20 -15 

J/.ctg tgg cea ttt etc act tgg ata aac cct gca ctg tee ate tgt gae 191 
;;teu Trp Pro Phe Leu Thr Trp He Asn Pro Ala Leu Ser He Cys Asp 

"^^ -5 1 

IKPccc tta gga tec tgc gga tgg eyw tgc cac acg gee ear gtc ect gcg 239 
Pro Leu Gly Ser Cys Gly Trp Xaa Cys His Thr Ala Gin Val Pro Ala 

5 10 15 

ccc ctg car ttg cct act gee tgt cct cee etc cea eat gge ace egg 287 
Pro Leu Gin Leu Pro Thr Ala Cys Pro Pro Leu Pro His Gly Thr Arg 
20 25 30 35 

get gta gge ccc acg cea gge etc etc ect gag get gca gee cea sge 33 5 

Ala Val Gly Pro Thr Pro Gly Leu Leu Pro Glu Ala Ala Ala Pro Xaa 

40 45 50 

acg tgk ggg gca ctg tee tea cge age agg cac tgg tea tgt tec att 383 
Thr Xaa Gly Ala Leu Ser Ser Arg Ser Arg His Trp Ser Cys Ser He 

55 60 65 

gtc arc tgc etc cac ctg cac ara etc ctg tet gtg gag ace aga arc 431 
Val Xaa Cys Leu His Leu His Xaa Leu Leu Ser Val Glu Thr Arg Xaa 

70 75 80 

ttc eas aaa cat ctg ttg gtg ctg ctg gtg get gtg gcc cat agt gtt 4 79 

Phe Xaa Lys His Leu Leu Val Leu Leu Val Ala Val Ala His Ser Val 

85 90 95 

ctg gaa cea ect gcc ctg gtc cea aat gtg cag tgt gag atg tgc aea 527 
Leu Glu Pro Pro Ala Leu Val Pro Asn Val Gin Cys Glu Met Cys Thr 



-260- 



100 105 110 115 

cac tea ggg ccc cgt gac ctg gaa gcc gca gtc gtg tec cca gca cct 575 

His Ser Gly Pro Arg Asp Leu Glu Ala Ala Val Val Ser Pro Ala Pro 

120 125 130 

tgg gaa tgagcctgtc ctctgtgtga aggagggggt ggttctcaaa ccactgactc 631 
Trp Glu 

ttggtgctca ggaggggcct gctgctgtcc tgggcatggg gtggtcattg ttcaagactg 691 

aggcagactc agtctttgaa agggtgcaga ggccaggcgc ggtggctcac gcctgtaatt 751 

ccagcacttt gggaggccaa ggtggacaga tcatgaggtc aggagttcga gaccagcctg 811 

gccaatacgg tgaaaccgca tctctactaa rraatawcaw aaattagtcg ggcatgggtg 871 

atgtgtgctt gtagtcccag ctactcatga ggyctgaggc agaagaatca cctgaatctg 931 

ggaggcagag gttgcagtga accaagatcg cacgactgta caccagcctg ggcgacagag 991 

tgagactccg tctcaaaaaa aaaaam 1017 



<210> 322 
<211> 529 
<212> DNA 
<213> Homo sapiens 

.,..<22 0> 
^=:jj<221> CDS 
'fi<222> 107. .427 

V'^221> sig_peptide 

yik222> 107. .190 

J5<223> Von Heijne matrix 

yj score 3.79999995231628 

Ij'l seq RFLSLSAADGSDG/SH 

^ ;^<221> polyA__signal 
!;r>222> 499. .504 

^^'^221> polyA^site 
''^^:i<222> 516 . .529 

iVk400> 322 

aaagtcagcg ctggagtcgg ctaggcggct ggaaacggcg gctgccgccg gtgactcagg 60 
gaggcgggag gccgmsggmg gagctcttcc tgcaggcgtg garacc atg gtg etc 115 



Met Val Leu 



acg 


etc 


gga. 


gaa 


agt 


tgg 


ccg 


gta 


ttg 


gtg 


ggg 


agg 


agg 


ttt 


etc 


agt 


163 


Thr 


Leu 


Gly 


Glu 


Ser 


Trp 


Pro 


Val 


Leu 


Val 


Gly Arg 


Arg 


Phe 


Leu 


Ser 




-25 










-20 










-15 










-10 




ctg 


tec 


gca 


gcc 


gac 


ggc 


age 


gat 


ggc 


age 


cae 


gac 


age 


tgg 


gac 


gtg 


211 


Leu 


Ser 


Ala 


Ala 


Asp 
-5 


Gly 


Ser 


Asp 


Gly 


Ser 
1 


His 


Asp 


Ser 


Trp 


Asp 


Val 




gag 


cge 


gtc 


gcc 


gag 


tgg 


ccc 


tgg 


etc 


tec 


ggg 


ace 


att 


5 

cga 


get 


gtt 


259 


Glu 


Arg 


Val 


Ala 


Glu 


Trp 


Pro 


Trp 


Leu 


Ser 


Gly Thr 


He 


Arg 


Ala 


Val 








10 










15 










20 








tec 


cae 


acc 


gac 


gtt 


acc 


aag 


aag 


gat 


ctg 


aag 


gtg 


tgt 


gtg 


gaa 


ttt 


307 


Ser 


His 


Thr 


Asp 


Val 


Thr 


Lys 


Lys 


Asp 


Leu 


Lys 


Val 


Cys 


Val 


Glu 


Phe 






25 










30 










35 










gak 


ggg 


gaa 


tct 


tgg 


agg 


aaa 


aga 


aga 


tgg 


ata 


gaa 


gtc 


tac 


age 


ett 


355 


Xaa 


Gly 


Glu 


Ser 


Trp 


Arg 


Lys 


Arg 


Arg 


Trp 


He 


Glu 


Val 


Tyr 


Ser 


Leu 




40 










45 










50 








55 




eta 


agg 


aaa 


gca 


ttt 


tta 


gta 


aaa 


eat 


aat 


ttg 


gtt 


tta 


get 


gaa 


cga 


403 



-261- 

Leu Arg Lys Ala Phe Leu Val Lys His Asn Leu Val Leu Ala Glu Arg 

60 65 70 

aag tea cct gaa att tct tgg ggt taaccatctt tagttaaatg gaattttaat 457 
Lys Ser Pro Glu lie Ser Trp Gly 
75 

ttaaatgacg ctttgctaat tttaagtgtt aagcattttg cattaaaata ttcatataat 517 
aaaaaaaaaa aa coq 



<210> 323 

<211> 1046 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 45. .407 

<221> sig__peptide 

<222> 45. .83 
pj.<223> Von Heijne matrix 

score 5.69999980926514 
S/i seg MLVLRSALTRALA/SR 

^='^^<221> polyA^signal 
y'k222> 1008 . .1013 

LJk221> polyA^site 
lf|<222> 1032. .1042 

l^^400> 323 

^;|^aaaggacac ggctggctgc ttttctcagc gccgaagccg cgcc atg etc gtc etc 56 
il Met Leu Val Leu 

L~ -10 



^^aga 


age 


gee 


ctg 


act 


egg 


gcg 


ctg 


gee 


tea 


egg 


aeg 


ctg 


gcg 


cet 


eag 


104 


ii^rg 


Ser 


Ala 


Leu 


Thr 
-5 


Arg 


Ala 


Leu 


Ala 


Ser 


Arg 


Thr 


Leu 


Ala 


Pro 


Gin 




atg 


tge 


tea 


tct 


ttt 


get 


aeg 


gga 


ecc 


1 

aga 


caa 


tac 


gat 


5 

gga 


ata 


ttc 


152 


Met 


Cys 


Ser 


Ser 


Phe 


Ala 


Thr 


Gly 


Pro 


Arg 


Gin 


Tyr 


Asp 


Gly 


He 


Phe 








10 










15 










20 










tat 


gaa 


ttt 


egt 


tct 


tat 


tac 


ett 


aag 


ecc 


tea 


aag 


atg 


aat 


gag 


ttc 


200 


Tyr 


Glu 


Phe 


Arg 


Ser 


Tyr 


Tyr 


Leu 


Lys 


Pro 


Ser 


Lys 


Met 


Asn 


Glu 


Phe 






25 










30 










35 












ctg 


gaa 


aat 


ttt 


gag 


aaa 


aac 


get 


eaa 


ett 


egg 


aea 


get 


eac 


tct 


gaa 


248 


Leu 


Glu 


Asn 


Phe 


Glu 


Lys 


Asn 


Ala 


Gin 


Leu 


Arg 


Thr 


Ala 


His 


Ser 


Glu 




40 










45 










50 










55 




ttg 


gtt 


gga 


tac 


tgg 


agt 


gta 


kaa 


ttt 


gga 


ggc 


aga 


atg 


awt 


aea 


gtg 


296 


Leu 


Val 


Gly 


Tyr 


Trp 


Ser 


Val 


Xaa 


Phe 


Gly 


Gly Arg 


Met 


Xaa 


Thr 


Val 












60 










65 










70 






ttt 


cat 


att 


tgg 


aag 


tat 


gat 


aat 


ttt 


get 


cat 


cga 


act 


gaa 


ttt 


eag 


344 


Phe 


His 


He 


Trp 


Lys 


Tyr 


Asp 


Asn 


Phe 


Ala 


His 


Arg 


Thr 


Glu 


Phe 


Gin 










75 










80 










85 








aaa 


gee 


ttg 


gee 


aaa 


gat 


aag 


gaa 


tgg 


caa 


gaa 


eaa 


ttc 


etc 


att 


cea 


392 


Lys 


Ala 


Leu 


Ala 


Lys 


Asp 


Lys 


Glu 


Trp 


Gin 


Glu 


Gin 


Phe 


Leu 


He 


Pro 





90 95 100 

aat ttg get etc aat tgataaacaa gatagtgaga ttacttatct ggtaeeatgg 447 



-262- 



Asn Leu Ala Leu Asn 
105 

tgcaaattag aaaaacctcc aaaagaagga gtctatgaac tggccacttt tcagatgaaa 507 

cctggtgggc cagctctgtg gggtgatgca tttaaaaggg cagttcatgc tcatgtcaat 567 

ctaggctaca caaaactagt tggagtgttc cacacagagt acggagcact caacagagtt 627 

catgttcttt ggtggaatga gagtgcagat agtcgtgcag ctgggagaca taagtcccat 687 

gaggatccca gagttgtggc agctgttcgg gaaagtgtca actacctagt atctcagcag 747 

aatatgcttc tgattcctac atcgttttca ccactgaaat agttttctac tgaaatacaa 807 

aacatttcat taactgctat aggatctgtc tgctaatggt gcttaaattc tcccaagagg 867 

ttctcacttt tatttgaagg aggtggtaag ttaatttgct atgtttcttg cattatgaag 927 

gctacatctg tgctttgtaa gtaccacttc aaaaaatakt tctgtttact ttctgcatgg 987 

tatttcagtg tctgtcatac attaaaaata cttgtcactg tttyaaaaaa aaaaammcc 1046 



<210> 324 
<211> 880 
<212> DNA 
<213> Homo sapiens 

<220> 
..,^c221> CDS 
^J;|:222> 201. .332 

;^;^221> sig__peptide 

^';^222> 201. .251 

y'k223> Von Heijne matrix 

S score 7.80000019073486 

yj seq VLWLISFFTFTDG/HG 

.J <221> polyA_site 
l^<222> 869. .880 

J';M00> 324 

:|^pattgctgat ggatcagtga gcctgtgttc atgccagtga gctgctgtgg ctcagatact 60 

'^atactttct ttccaaacag cataagaagt gattgancca caagtatact gaaggmargg 120 

^l^^hcccwsvar tyctggwgtg amgagataaa tcaccagtca cagactatgc acccgactgc 180 

Olfcgctgttcag tccagggaaa atg aaa gtt gga gtg ctg tgg etc att tct ttc 233 

Met Lys Val Gly Val Leu Trp Leu lie Ser Phe 
-15 -10 

ttc acc ttc act gac ggc cac ggt ggc ttc ctg ggg gtg agt tgg tgc 281 
Phe Thr Phe Thr Asp Gly His Gly Gly Phe Leu Gly Val Ser Trp Cys 

-5 15 10 

tat gtc tea tat etc ttc tea act aac tct cct etc teg ttc egg egc 329 
Tyr Val Ser Tyr Leu Phe Ser Thr Asn Ser Pro Leu Ser Phe Arg Arg 

15 20 25 

att tagaacccet eactctctag gggaetgcaa ctgeataatt taatgtactt 382 
He 

gagatcagaa gtcetgagtt ctcgttteaa cattaecaae attcaetgtg tggccttgga 442 

taagtragtc attteatetc ttcggagett agatgatcma aetgeaarag gaggatettt 502 

gattamacta tcttagagat cttttccagt teaacacatg etgtactatg gcttctcgga 562 

tgcagaaaaa tcaeatggat ggaeattagc aatecttara cactgtcttt cetgtetaca 622 

ctcgettgag tgatgcktte atetaggatc atggttttaa tattetctae atgctgatga 682 

etcecagctg tatagctcca tetcagaacc tctcccetgt ccacactcac atatccatta 742 

cctacgtgtt atttccaget gggaaatcea geggaacctc ggnaacttea tttgnttcaa 802 

aatcgnaacc caatccttct tgectatctc agcaagtggt atcaetatct ttceagctac 862 

ttaggeaaaa aaaaaaaa oor. 
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<210> 325 

<211> 1217 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 217. .543 

<221> sig_peptide 

<222> 217. .255 

<223> Von Heijne matrix 

score 6.40000009536743 

seq MCLLTALVTQVIS/LR 

<221> polyA_site 
<222> 1206 . .1217 

,,^400> 325 

'^^Jl^^tgccagtg tcagcttctc tccgaaaact gggtaatacg aaatggtctt tattggttgt 60 
Jj^^aacactcga gctgagaaac attttaggat ctttgtgtct tttgtgatga ttttgtttct 120 
^^%raagrwgga aasctgtcta aaaatattca agtgtgcaac caaggattta gatgaagcca 180 
pjgcaaacaaag gaatcatgta atcaggacct gagcga atg tgc tta etc acg gcg 234 
^^Jj Met Cys Leu Leu Thr Ala 

-10 

y.fcta gtt aca cag gtg att tec tta aga aaa aat gea gag aga act tgt 282 
l\%eu Val Thr Gin Val He Ser Leu Arg Lys Asn Ala Glu Arg Thr Cys 
. -5 1 5 

i,|tta tgc aag agg aga tgg cee tgg nge cce teg cec egg ate tae tge 33 0 

J,f.eu Cys Lys Arg Arg Trp Pro Trp Xaa Pro Ser Pro Arg He Tyr Cys 

1^ 20 25 

p'tea tee aee cea tge gat tec aaa ttc eec ace gtc tac tec agt gee 3 78 

'^Hfeer Ser Thr Pro Cys Asp Ser Lys Phe Pro Thr Val Tyr Ser Ser Ala 

35 40 
|]:bea ttc cat gee cee etc eec gtc eag aat tec tta tgg ggg eac eeg 426 
Pro Phe His Ala Pro Leu Pro Val Gin Asn Ser Leu Trp Gly His Pro 

45 50 55 

etc cat ggt tgt tec tgg caa tge cae cat cce cag gga car aat etc 474 
Leu His Gly Cys Ser Trp Gin Cys His His Pro Gin Gly Gin Asn Leu 

60 65 70 

cag cet gee agt etc cad ace cat etc tee aag cce aag egc eat ttt 522 
Gin Pro Ala Ser Leu Xaa Thr His Leu Ser Lys Pro Lys Arg His Phe 

75 80 85 

ara aar aar rra tgt caa gee tgatgaarae atgagtggca aaaacattgc 573 
Xaa Lys Lys Xaa Cys Gin Ala 
90 95 

aatgtacara aatgagggtt tctatgctga tccttaectt tatcaegagg gacggatgag 633 
catasectea teccatggtg gaeacccact ggatgtecce gaecacatca ttgcatatca 693 
ecgeaeegcc ateeggtcag cgagtgctta ttgtaacecc tcaatgeaag eggaaatgea 753 
tatggaacaa teactgtaca gaeagaaatc aaggaaatat eeggatagee atttgeetae 813 
actgggetee aaaacaccee ctgectctec tcaeagaktc agtgacetga ggatgataga 873 
eatgeaegct caetataatg cecaeggeec ecctcacace atgcagccag acegggcete 933 
tecgageegc eaggccttta aaaaggagce aggcaecttg gtgtatatag aaaagccaeg 993 
gagcgetgea ggattatcca gccttgtaga ceteggeeet eetetaatgg agaageaagt 1053 
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ttttgcctac agcacggcga caatacccaa agacagagag accagagaga ggatgcaagc 1113 
catggagaaa cagattgcca gtttaactgg ccttgttcag tctgcgcttt ttaaagggcc 1173 
cattacaagt tatagcaaar atgcgtctag ctaaaaaaaa aaaa 1217 



<210> 326 
<211> 959 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 18. .446 

<221> sig_j)eptide 

<222> 18 . . 140 

<223> Von Heijne matrix 

score 4.09999990463257 

seg GILILWIIRLLFS/KT 

^:,i:,<221> polyA_signal 
^222> 930. .935 

y^<221> poly^site 
Nx222> 948. .959 

4ic400> 326 

tljaaaggaagcg gctaact atg gcg acc gcc acg gag cag tgg gtt ctg gtg 50 

IJ1 Met Ala Thr Ala Thr Glu Gin Trp Val Leu Val 

-40 -35 



.i^ag 


atg 


gta 


cag 


gcg 


ctt 


tac 


gag 


get 


cct 


get 


tac 


cat 


ctt 


att 


ttg 


98 


?piu 


Met 


Val 


Gin 


Ala 


Leu 


Tyr 


Glu 


Ala 


Pro 


Ala 


Tyr 


His 


Leu 


He 


Leu 




■!'-3 0 










-25 










-20 










-15 




' ^aa 


ggg 


att 


ctg 


ate 


etc 


tgg 


ata 


ate 


aga 


ctt 


ctt 


ttc 


tct 


aag 


act 


146 


'^Slu 


Gly 


He 


Leu 


He 
-10 


Leu 


Trp 


He 


He 


Arg 
-5 


Leu 


Leu 


Phe 


Ser 


Lys 


Thr 




;J:ac 


aaa 


tta 


caa 


gaa 


cga 


tct 


gat 


ctt 


aca 


gte 


aag 


gaa 


aaa 


1 

gaa 


gaa 


194 


Tyr 


Lys 


Leu 


Gin 


Glu 


Arg 


Ser 


Asp 


Leu 


Thr 


Val 


Lys 


Glu 


Lys 


Glu 


Glu 








5 










10 










15 










ctg 


att 


gaa 


gag 


tgg 


caa 


cca 


gaa 


cct 


ctt 


gtt 


cct 


cct 


gte 


cca 


aaa 


242 


Leu 


He 


Glu 


Glu 


Trp 


Gin 


Pro 


Glu 


Pro 


Leu 


Val 


Pro 


Pro 


Val 


Pro 


Lys 






20 










25 










30 










gac 


cat 


cct 


get 


etc 


aac 


tae 


aac 


ate 


gtt 


tea 


ggc 


cct 


cca 


age 


cac 


290 


Asp 


His 


Pro 


Ala 


Leu 


Asn 


Tyr 


Asn 


He 


Val 


Ser 


Gly Pro 


Pro 


Ser 


His 




35 










40 










45 










50 




aaa 


act 


gtg 


gtg 


aat 


gga 


aaa 


gaa 


tgt 


ata 


aac 


ttc 


gcc 


tea 


ttt 


aat 


338 


Lys 


Thr 


Val 


Val 


Asn 


Gly 


Lys 


Glu 


Cys 


He 


Asn 


Phe 


Ala 


Ser 


Phe 


Asn 












55 










60 










65 






ttt 


ctt 


gga 


ttg 


ttg 


gat 


aac 


cct 


agg 


gtt 


aag 


gca 


gca 


get 


tta 


gca 


386 


Phe 


Leu 


Gly Leu 


Leu 


Asp 


Asn 


Pro 


Arg 


Val 


Lys 


Ala 


Ala 


Ala 


Leu 


Ala 










70 










75 










80 








tct 


eta 


aag 


aag 


tat 


ggc 


gtg 


ggg 


act 


tgt 


gga 


ecc 


tgt 


gga 


ttt 


tat 


434 


Ser 


Leu 


Lys 


Lys 


Tyr 


Gly 


Val 


Gly 


Thr 


Cys 


Gly 


Pro 


Cys 


Gly 


Phe 


Tyr 








85 










90 










95 






ggc 


aca 


ttt 


gaa 


tgaaratgaa ggatcattga tttccttgtg tatggataat 




486 


Gly 


Thr 


Phe 


Glu 
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100 

ccgggaacag gccaactaaa tatttgatga atgtatgatt tcaaatacag tgaattccct 546 

gggagtcatc aaaraagacg gcattttatg gttgttttta ttaagtgtat attctttgct 606 

cctgaaaatg ttattaaata attgtttagg ccgggcatgg tggctcatgc ctgtaatccc 666 

agcactttca aaggctgagg caggcagatc acctgaggtc aggagttcaa aaccagcctg 726 

gccaacatgc tgaaacctcg tctctactaa aaatacaaaa attagctggg cgtggtggtg 786 

grtgcctgtg gtcccagctr cgtgggaggc tgaggtggga gaattgcttc aacctgggag 846 

gcggaggttg cagtgagccg agatcatgcc actgcactcc agcctgggca acagagcaag 906 

actgtctcaa aaataaataa ataaataaaa ttgtttaaat gaaaaaaaaa aaa 959 



<210> 327 
<211> 921 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 29. .724 

^^221> sig_j)eptide 

^^|<222> 29. .118 

j!;<223> Von Heijne matrix 

score 3.90000009536743 
||'^ seq VAHALSLPAESYG/NX 

^p^<221> polyA_signal 
|jj<222> 886 . .891 

<221> polyA_site 
i„l<:222> 910. .920 

;';^400> 327 

[^'^aggagccac gctttcgggg gttgcaag atg gcg gcc acc agt gga act gat 52 

Met Ala Ala Thr Ser Gly Thr Asp 

1 -25 

ujgag ccg gtt tec ggg gag ttg gtg tct gtg gca cat gcg ctt tct etc 100 
Glu Pro Val Ser Gly Glu Leu Val Ser Val Ala His Ala Leu Ser Leu 

-20 -15 -10 

cca gca gag teg tat ggy aac grt yet gac att gag atg get tgg gee 148 
Pro Ala Glu Ser Tyr Gly Asn Xaa Xaa Asp He Glu Met Ala Trp Ala 

-5 15 10 

atg aga gca atg cag cat get gaa gte tat tac aag ctg att tea tea 196 
Met Arg Ala Met Gin His Ala Glu Val Tyr Tyr Lys Leu He Ser Ser 

15 20 25 

gtt gac cca cag tte ctg aaa etc ace aaa gta gat gac caa att tac 244 
Val Asp Pro Gin Phe Leu Lys Leu Thr Lys Val Asp Asp Gin He Tyr 

30 35 40 

tct gag tte egg aaa aat ttt gag ace ctt agg ata gat gtg ttg gre 292 
Ser Glu Phe Arg Lys Asn Phe Glu Thr Leu Arg He Asp Val Leu Xaa 

45 50 55 

cca gaa gan etc aag tea gaa tea gen aaa gag cee cca gga tac aat 340 
Pro Glu Xaa Leu Lys Ser Glu Ser Ala Lys Glu Pro Pro Gly Tyr Asn 

60 65 70 

tct ttg cca ttg aaa ttg etc gga acc ggg aag get ata aca aag ctg 388 
Ser Leu Pro Leu Lys Leu Leu Gly Thr Gly Lys Ala He Thr Lys Leu 
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75 80 85 90 

ttt ata tea gtg ttc agg aca aag aag gag aga aag gag tea aca atg 436 

Phe He Ser Val Phe Arg Thr Lys Lys Glu Arg Lys Glu Ser Thr Met 

9S 100 105 

gag gag aaa aaa gag ctg aca gtg gag aag aag aga aca cca aga atg 484 
Glu Glu Lys Lys Glu Leu Thr Val Glu Lys Lys Arg Thr Pro Arg Met 

110 115 120 

gag gag aga aag gag ctg ata gtg gag aag aaa aag agg aag gaa tea 532 
Glu Glu Arg Lys Glu Leu He Val Glu Lys Lys Lys Arg Lys Glu Ser 

125 130 135 

aca gag aag aca aaa ctg aca aag gag gag aaa aag gga aag aag ctg 580 
Thr Glu Lys Thr Lys Leu Thr Lys Glu Glu Lys Lys Gly Lys Lys Leu 

140 145 150 

aca aag aaa tea aca aaa gtg gtg aaa aag eta tgt aag gta tac agg 628 
Thr Lys Lys Ser Thr Lys Val Val Lys Lys Leu Cys Lys Val Tyr Arg 
155 160 165 170 

gaa cag cae tet aga age tat gae tea att gag act aca agt ace aeg 676 
Glu Gin His Ser Arg Ser Tyr Asp Ser He Glu Thr Thr Ser Thr Thr 

175 180 185 

gtg eta ctt gca cag ace cet ttg gtt aaa tgt aaa ttc ttg tac aat 724 
Val Leu Leu Ala Gin Thr Pro Leu Val Lys Cys Lys Phe Leu Tyr Asn 
190 195 200 

^'^l^tgaaggatac geagaaggac atetttctag tctaacagtc aggagctget etggteattc 784 
^^^^bcttgtatga aetggtctaa agaetgttag tggggtgtta gttgattttt cetggtatae 844 
M:ltgtttcttgg etgacaetac tggtcaagta agaaatttgt aaataaattt cttttggttc 904 
^%tattaamaa aaaaaas qon 



|j!^210> 328 
^''<211> 1344 
r^212> DNA 
^J'^213> Homo sapiens 

^^^^^220> 
''^^:k221> CDS 
djc222> 404 . ,586 

<221> sig_peptide 

<222> 404. .466 

<223> Von Heijne matrix 

score 4.09999990463257 

seq SLMFFSMMATCTS/NV 

<221> polyA_signal 
<222> 1304. .1309 

<221> polyA_site 
<222> 1334. .1344 

<400> 328 

ataatttaat geaaaatate ettttatgaa tttcatgtta atattgtgaa atattaaaat 60 

aatteeacaa tagttgagaa aaatgagcat ttttttccat ttttaaaaaa tgcatagaaa 120 

agacaatttt aaaatcctgg gamecawatt tatttagaag tagetgttag taaaacatta 180 

gaaaaggagt eaggecatba ggttatttat nbnaatctet aagcaattag gntgaagtta 240 

ttaagteaag cctagaaaag ctgectcett gtaaggcttt catgacaatg tatagtaatc 300 

brcagtgtcc aattcttcgc actcctcagg aatateacta ectcaggtta eggtacacag 360 
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gctataattg atgatgatgt tcagataact gaagacacaa 


taa 


atg 


aca 


tte 


aga 


415 




















Met 


Thr 
-20 


Phe 


Arg 




cat cag gac 


aat 


tec etc 


atg 


tte 


ttt 


tet 


atg 


atg 


gee 


ace 


tgt 


acc 


463 


His Gin Asp 


Asn 


Ser Leu 


Met 


Phe 


Phe 


Ser 


Met 


Met 


Ala 


Thr 


Cys 


Thr 




-15 








-10 










-5 








age aac gtg 


ggt 


tte acc 


eac 


aca 


aeg 


atg 


aac 


tgt 


tet 


ctt 


act 


tet 


511 


Ser Asn Val 


Gly 


Phe Thr 


His 


Thr 


Thr 


Met 


Asn 


Cys 


Ser 


Leu 


Thr 


Ser 




1 




5 










10 










15 




cca gtt gat 


ttt 


aaa gac 


ttg 


tta 


aga 


gtc 


tta 


eta 


ata 


aaa 


ttt 


ggg 


559 


Pro Val Asp 


Phe 


Lys Asp 


Leu 


Leu 


Arg 


Val 


Leu 


Leu 


He 


Lys 


Phe 


Gly 








20 








25 










30 




tat gat aga 


aaa 


tec aca 


ate 


aaa 


tet 


tgaaccaaat aacatattaa 




606 


Tyr Asp Arg 


Lys 
35 


Ser Thr 


He 


Lys 


Ser 
40 



















attactaata tttaagtgat ggaagacaea caaaaaaett aaaagcaega acaacetaac 666 

ttgaaaaara attttaaaat atgattaaee tgaaraaaar araateetaa ragceaaage 726 

teetttttat ttagcttgga attttectat tggttcctaa caaactgtec eaatgteata 786 

taaggaaaca tgatetatta cattccttta taacaaegtg gararactat aaaeetatgt 846 

aagtagtaaa actatatcag adactcagga ractgactww aaggectgga tctgcagtgt 906 

attatctgta taaaaattgg cagggggaag etaaaaggaa aggagattgg agatctcaat 966 

^.,|tctatcatgg tgtattteat aegcaaatea rageatgeat tgttttttgt ttttggaaar 1026 

ll'^IJpLvaarggaag tgtgttetgc eccatgtttc etteegtgtt tatagtteaa aetctatata 1086 

J;!*jtacttcaggt attttttgtt tageecttca ttataaatgg geaggaaatt gtttatcaac 1146 

[|'ctagecagtt tattactagt gaecttgact teagtatctt gageattett ttatattttt 12 06 

r|;bttttattat ectgagtctg taaetaaaea attttgtett caaattttta tccaatatec 1266 

y kttgeaccac accaaatcaa gcttettgat ttteaaaaat aaaaaggggg aaatacttae 1326 

==|i^aacttgtaaa aaaaaaaa 1344 



yi5210> 329 
^^^s<211> 585 
r^J<212> DNA 
J '52 13 > Homo sapiens 

^ISc220> 
fi!k221> CDS 

<222> 331. .432 

<221> sig_peptide 
<222> 331. .387 
<223> Von Heijne matrix 
score 7 

seq AGLSSCLLPLCWL/ER 

<221> polyA_signal 
<222> 548 . . 553 

<221> polyA_site 
<222> 573 . .585 

<400> 329 

aagectaggt gtggcgeecc gaeeggaett teaettctgg ceageccttt ccccaeetgg 60 

gcgegggass ggtgceagtc tttaaacaac ctctcgatgg gtcceacgaa gatgtttcea 120 

gacccttgga atgceaagtt caagtttage tatgtetege ggagaggceg gtggaagaag 180 

eaaegagaat gaagcaecec agttctetgc tgageacatg ggcatctgca ataaagattt 240 
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aatttcccag cttctcctga agctcggtat ggccacaaca ctaaattctg cccgaggaga 300 
ttgagcaaaa tagtatggga cttccaagaa atg ttt tta aag tea ggg gca ggc 354 

Met Phe Leu Lys Ser Gly Ala Gly 
-15 

ctt tct tea tgc ctt ctt cct ctt tgc tgg ctg gaa cgc aaa gac cat 402 
Leu Ser Ser Cys Leu Leu Pro Leu Cys Trp Leu Glu Arg Lys Asp His 

-10 -5 15 

ggc agg agg cca age asc cat cct gga agg tgaaagcctc atactaagga 452 
Gly Arg Arg Pro Ser Xaa His Pro Gly Arg 

10 15 
cgtcaracag cgaaataara rcctgggtcc ttgaccctgt aaasatctcc ctccccatcc 512 
tggtctgtct gccttgactc ctttcatatg aaaaaaataa acttttaact tgcgtwaacc 572 
aaaaaaaaaa aaa coc 



<210> 330 
<211> 914 
<212> DNA 
<213> Homo sapiens 

^'^<221> CDS 
jj'f<222> 59. .703 

Hkc221> sig_peptide 

ij'l<222> 59. .220 

4;<223> Von Heijne matrix 

yj score 5.09999990463257 

m seq FLLSQMSQHQVHA/VQ 

ii^;j<221> polyA^signal 
r'Jc222> 886 . .891 

v^i^22l> polyA_site 
'^ik222> 903 . .914 

rrk400> 330 

acaaatatca atgatgttta tgaatctagt gtgaaagtkt taatcacatc acaaggct 58 



atg 


aac 


rra 


tat 


gca 


agt 


cca 


ttc 


aac 


tgw 


caa 


ttg 


ard 


tat 


ttg 


gak 


106 


Met 


Asn 


Xaa 


Tyr 


Ala 


Ser 


Pro 


Phe 


Asn 


Xaa 


Gin 


Leu 


Xaa 


Tyr 


Leu 


Xaa 












-50 










-45 








-40 






ttg 


age 


agr 


ttc 


gag 


tgt 


gtr 


cat 


aga 


gat 


gga 


aga 


gta 


att 


aea 


ctg 


154 


Leu 


Ser 


Arg 


Phe 


Glu 


Cys 


Val 


His 


Arg 


Asp 


Gly Arg 


Val 


He 


Thr 


Leu 










-35 










-30 










-25 








tct 


tat 


cag 


gag 


cag 


gag 


eta 


cag 


gat 


ttt 


ctt 


ctg 


tct 


cag 


atg 


tea 


202 


Ser 


Tyr 


Gin 


Glu 


Gin 


Glu 


Leu 


Gin 


Asp 


Phe 


Leu 


Leu 


Ser 


Gin 


Met 


Ser 








-20 










-15 










-10 










cag 


cac 


cag 


gta 


eat 


gca 


gtt 


cag 


caa 


etc 


gee 


aag 


gtt 


atg 


ggc 


tgg 


250 


Gin 


His 
-5 


Gin 


Val 


His 


Ala 


Val 
1 


Gin 


Gin 


Leu 


Ala 
5 


Lys 


Val 


Met 


Gly 


Trp 




caa 


gta 


ctg 


age 


ttc 


agt 


aat 


cat 


gtg 


gga 


ctt 


gga 


cct 


ata 


gag 


10 
age 


298 


Gin 


Val 


Leu 


Ser 


Phe 


Ser 


Asn 


His 


Val 


Gly 


Leu 


Gly 


Pro 


He 


Glu 


Ser 




abt 








15 










20 










25 






ggt 


aat 


gca 


tct 


gee 


ate 


acg 


gtg 


gee 


cec 


caa 


gtg 


gtg 


act 


atg 


346 


Xaa 


Gly 


Asn 


Ala 


Ser 


Ala 


He 


Thr 


Val 


Ala 


Pro 


Gin 


Val 


Val 


Thr 


Met 










30 










35 










40 
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V— Uct 


u u u 


cag 


I-t-C 


gta 


acg 


gac 


ctg 


aaa 


gtg 


gea 


gca 


3ga 


tta 


tgg 


ttc 


394 


J-IC u. 




m n 


irlie 


va± 


iYieu 


Asp 


Leu 

o U 


T t m 

Lys 


Val 


Ala 


Ala 


Arg 
55 


Leu 


Trp 


Phe 






ttc 


etc 


y L.CI 


aoc 


aat 


gta 


aar 


ace 


ccc 


caa 


aaa 


gtg 


3tg 


ttt 


tac 


442 


Ser 


Phe 


Leu 


Val 


Thr 


Asn 


Val 


Lys 


Thr 


Phe 


Gin 


Lys 


Val 


Met 


Phe 


Tyr 
















O D 










70 












at3 




-a 'a 4- 




gtc 


a -h 

auc 


cue 


9tg 




eat 


tea 


aar 


aag 


ttc 


agt 


490 


Lys 


He 


Thr 


Asn 


Gly 


Val 


He 


Phe 


Val 


Gly 


His 


Ser 


Lys 


Lys 


Phe 


Ser 




f D 




















85 










90 






a -t- a 


aaa 


tgg 


33g 


gtc 


Jcaa 


att 


ttg 


ttt 


ata 


aaa 


tgg 


arm 


tgc 


tta 


538 




X±e 


Lys 


Trp 


Lys 


Val 


Xaa 


He 


Leu 


Phe 
100 


He 


Lys 


Trp 


Xaa 


Cys 
105 


Leu 






c tg 


cac 


CC3 


gcc 


ctt 


gtc 


tac 


tat 


gat 


ttt 


ttc 


car 


atg 


ttt 


cct 


586 


y o 


Leu 


rlis 


Leu 


A±3 


Leu 


va± 


Tyr 


Tyr 
115 


Asp 


Phe 


Phe 


Gin 


Met 
120 


Phe 


Pro 




333 


3r33 


rr-h 1- 

guu 


tec 


ara 


33C 


ttt 


gac 


ttg 


33a 


tgt 


ttg 


car 


ate 


aac 


tat 


634 


Lys 


Xaa 


Val 


Ser 


Xaa 


Asn 


Phe 


Asp 


Leu 


Lys 


Cys 


Leu 


Gin 


He 


Asn 


Tyr 








125 










130 










135 








aag 


C3C 


aaa 


gaa 


gar 


ata 


act 


tec 


aaa 


3ga 


gtg 


ctg 


ttt 


tta 


aaa 


at3 


682 


Lys 


His 


Lys 


Glu 


Glu 


He 


Thr 


Ser 


Lys 


Arg 


Val 


Leu 


Phe 


Leu 


Lys 


He 






140 










145 










150 












att 


agg 


aaa 


tgt 


ttt 


att 


tagcaetttc aaacttttca etttata3at 


733 


fie 
^155 


He 


Arg 


Lys 


Cys 


Phe 
160 


He 























j=^ac3agtget ttgaa3tgca gaagtttatg taeagttgta tataeagtat gacaagatgt 793 
r^^aaataatat gtttttcatg eagtttaaaa tattactaac ttaagggttt etatgtgett 853 
irttt33aatat tccttctttg atgttgacat ca3ata3agt atgtggttta aaaaaaaaaa 913 

914 



J'^^210> 331 

^;;'f:211> 1161 

^';^212> DNA 

j'"<213> Homo sspiens 

4;k220> 
y;K221> CDS 
<222> 672 . .752 

<221> sigjpeptide 

<222> 672. .722 

<223> Von Heijne m3trix 

score 4.30000019073486 
seq LLYAHLSFTSKRA/W 

<221> polyA_site 
<222> 1150. . 1161 

<400> 331 

aagatatcac tgtettgttt tc3ett3gat 
ataaagcett cetttaaage tttataataa 
ettatagtat gcatatattc ageatatgtt 
ccctttcatt gcaacttgca agtgagaaaa 
tatttcttet tetcagggtg tctecctgcc 
agtgaaaatg tttgaaacat gaaacatgtc 
artgattttc atatatgcct ctgcccattt 



cctacttaca a3gtg3gggt t3tt33eaga 60 

teat at tta t taataatget gttgtgeata 120 

gc3tgtsttc 3gaattacat aagatgaaat 180 

gatecttagt ggctctggtg gaag33at3g 240 

ttggcccctc ecagaagcec eggctttaaa 300 

tgtaggaage atcagcatgg ccataagtgc 360 

caa3tatatt tttgaeatga ataaatetaa 420 
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cagtatacar aataattcat gtaaraccct aacgtgtaca tgtgaaaaag catttctata 480 

taatgtgagg agcactggcc atcaattagg gaaataaagg tcatgtaata ttgcaaattt 540 

tcaaaataga gcsstgcaag ataactgcaa tcataccaaa aactatttga gtaaatggat 600 

ttttaaagta atttttgttt aaaaaaattt atatttcaga agsagaaaat gtcaaatgat 660 

agtctttgta a atg gtg gtg cac ctt etc tat gca cat ctg tct ttt aca 710 
Met Val Val His Leu Leu Tyr Ala His Leu Ser Phe Thr 
-15 -10 -5 

tea aaa aga get gtg gtc atg eta aaa tta gag ata act ttt 752 
Ser Lys Arg Ala Val Val Met Leu Lys Leu Glu lie Thr Phe 

15 10 

tgaatgactt ggtcaagctg tgtgtaaaat atttaaccat aagtcaagta cagtgtacta 812 

tgtttaataa agttacattt aatgcattta ttgcatatat gaatatatac atgaagaggc 872 

tttatgtctt ctggtatttg attttgaatg ttttttaagt cagtggtgcc tttaggcaag 932 

aactttcgaa attaatcatt ctttgtgttt tctgattttt caggtaacat gtacactatt 992 

tagaaaceat catagtttat tcaccttaaa aaattgattg tattatttaa atatatcact 1052 

tagatgggca tttcctataa ttaggatatt ccaaatagtt gctgaaatca attgtgccat 1112 

tgaccaatgg atgcacttgg ttagccttaa ttttttyaaa aaaaaaaaa 1161 



<210> 332 
,,^,<211> 363 
^3;f<212> DKA 
yj^<213> Homo sapiens 

i^'^^<220> 
Ij1<221> CDS 
4;:<222> 57. .311 

§j=^|<221> sig_peptide 

'<222> 57, .128 
^^,<223> Von Heijne matrix 
C', score 5.30000019073486 

J^f seq LFHLLFLPHYIET/FK 

^k221> polyA^signal 
^||k222> 332 . .337 

<221> polyA_site 
<222> 351. .363 

<400> 332 

acatttctta ctgccttacg ctcatcctga ggtccacctt ggtctctaaa aacacc atg 59 

tgt tct cat gcc tec atg tct ttt cac aca ctg ttc cat ttg etc 
Cys Ser His Ala Ser Met Ser Phe His Thr Leu Phe His Leu Leu 

-20 -15 -10 

etc cca cat tac att gaa act ttc aag cct cag teg aaa cat tgc 
Leu Pro His Tyr lie Glu Thr Phe Lys Pro Gin Ser Lys His Cys 

-5 1 5 

ttc tgg ata gca gcc ttc ttg aca tec etc etc act ccc cag tec 
Phe Trp lie Ala Ala Phe Leu Thr Ser Leu Leu Thr Pro Gin Ser 
10 15 20 

cag ggc ttc cat age tct tta tgt gca ctt cga tee cag cat ttt 
Gin Gly Phe His Ser Ser Leu Cys Ala Leu Arg Ser Gin His Phe 

30 35 40 

teg act tgt aat tgt ttc tgc tac ctg aca ate ate gcc ttg drd 



Met 

ttc 107 
Phe 

ttc 155 
Phe 

eta 203 

Leu 

25 

cca 251 
Pro 

tac 299 
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Ser Thr Cys Asn Cys Phe Cys Tyr Leu Thr lie He Ala Leu Xaa Tyr 

45 50 55 

tgg gac aac ctt tgattactca ttatatcctc aataaatatt tgttgaacca 3 51 

Trp Asp Asn Leu 
60 

aaaaaaaaaa aa 3g3 



<210> 333 
<211> 645 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 80. .232 

<221> sigjpeptide 

<222> 80 . .127 

<223> Von Heijne matrix 

score 3.70000004768372 
'^^;f seq lALTLIPSMLSRA/AG 

l''k221> polyA^signal 
^;J<222> 617. .622 

4S<221> polyA^site 
hjk222> 634. .645 

r<^00> 333 

L^accttcttgt tatttatgct attctctttg tggctccatt cttctttcaa tcttctcagc 60 
ttataaccgt ctttccctt atg eta agg ata gcc ctt aca etc ate cca tct 112 
rr M®t Leu Arg lie Ala Leu Thr Leu He Pro Ser 

[ii -10 

■%tg ctg tea agg get get ggt tgg tgc tgg tac aag gag cec act cag 160 
^llHet Leu Ser Arg Ala Ala Gly Trp Cys Trp Tyr Lys Glu Pro Thr Gin 
ffiFS 1 5 10 

cag ttt tct tac ctt tgc ctg ccc tgc ctt tea tgg aat aar aaa gge 208 
Gin Phe Ser Tyr Leu Cys Leu Pro Cys Leu Ser Trp Asn Lys Lys Gly 

15 20 25 

aac gtt ttg cag ctt cca aat tte tgaaraaaet aatctcarat tggeagttaa 262 
Asn Val Leu Gin Leu Pro Asn Phe 

30 35 
agtcaaaatg ttgeeaaata tttatteett ttgcctaakt ttggctaccc ggtteaattg 322 
ctttttattt ttaatgtett gactcttear agttegtacc tcaaaaraae aatgaraaea 382 
tttgctttgc tttctgctga atcectaatc teaaeaatet atacctggac tgteeagtte 442 
tcctectgtg ctatcttcte ttctateeaa gtaraatgta ygecaggarc tcettccetc 502 
tarcaattte taetaaaatg tccaagtara atgtttectt ttacaatcaa attaetgtat 562 
ttattaattt gctaraatce aktaaateat tttggtagct etggetgtge tatcaataaa 622 
aagatgaaag eaaaaaaaaa aaa 545 



<210> 334 
<211> 400 
<212> DNA 
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<213> Homo sapiens 

<220> 
<221> CDS 
<222> 91. .291 

<221> sig_peptide 

<222> 91. .219 

<223> Von Heijne matrix 

score 3.79999995231628 

seq LISVLYLIPKTLT/TN 

<221> polyA_signal 
<222> 367. .372 

<221> polyA_site 
<222> 389. .400 

<400> 334 

aacaaaagga gagttttata attcacttta aaaggagatt tgatggtaaa gtttaaagat 60 
taaaatattt tgttcttcaa ttacagagcg atg acc cca cag tat ctg cct cac 114 

Met Thr Pro Gin Tyr Leu Pro His 
-40 

^J^pgt gga aaa tac caa gtt ctt gga gat tac tct ttg gca gtg gtc ttc 162 

J^fely Gly Lys Tyr Gin Val Leu Gly Asp Tyr Ser Leu Ala Val Val Phe 

f:^35 -30 -25 -20 

^'^.^^^^ ^ta att tct gtt tta tac ctt ata ccc aaa 210 

4liiPro Leu His Phe Ser Asp Leu lie Ser Val Leu Tyr Leu He Pro Lys 

W -15 -10 -5 

l^ca ctt act acc aac aca get gtt aaa cat tct ata caa aaa aat tgt 258 

Thr Leu Thr Thr Asn Thr Ala Val Lys His Ser He Gin Lys Asn Cys 
hh 15 10 

^;^ptg mat ctg gta tta gga aaa tta ctt tea cag taaatatcaa agaaaaaaga 311 
yjyiet Xaa Leu Val Leu Gly Lys Leu Leu Ser Gin 

15 20 
■^ttaagggtct ctttgccatg cttttcatca tatgcaccaa atgtaaattt tgtacaataa 3 71 
^lifeattttattt cctaagyaaa aaaaaaaaa 400 



<210> 335 
<211> 496 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 196. .384 

<221> sig_peptide 

<222> 196. .240 

<223> Von Heijne matrix 

score 6.69999980926514 
seq ILSTVTALTFARA/LD 

<221> polyA_signal 
<222> 461. .466 
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<221> polyA_site 
<222> 485. .496 

<400> 335 

aaaaaattgg tcccagtttt caccctgccg cagggctggc tggggagggc agcggtttag 60 
attagccgtg gcctaggccg tttaacgggg tgacacgagc htgcagggcc gagtccaagg 120 
cccggagata ggaccaaccg tcaggaatgc gaggaatgtt tttcttcgga ctctatcgag 180 
gcacacagac agacc atg ggg att ctg tct aca gtg aca gcc tta aca ttt 231 
Met Gly lie Leu Ser Thr Val Thr Ala Leu Thr Phe 
-15 -10 -5 

gcc aga gcc ctg gac ggc tgc aga aat ggc att gcc cac cct gca agt 279 
Ala Arg Ala Leu Asp Gly Cys Arg Asn Gly lie Ala His Pro Ala Ser 

15 10 
gag aag cac aga etc gag aaa tgt agg gaa etc gag age age cac teg 327 
Glu Lys His Arg Leu Glu Lys Cys Arg Glu Leu Glu Ser Ser His Ser 

15 20 25 

gcc cca gga tea acc cag cac cga aga aaa aca ace aga aga aat tat 3 75 

Ala Pro Gly Ser Thr Gin His Arg Arg Lys Thr Thr Arg Arg Asn Tyr 
30 35 40 45 

tct tea gee tgaaatgaak cegggateaa atggttgetg atcaragccc 424 
.,,per Ser Ala 

^'■ip^^^^t^^^t tggaaaagtc aaattgasea ttattaaata aagcttgttt aatatgtctc 484 
^^ikaacaaaaaa aa 49g 



JJk210> 336 
yk211> 968 
y^212> DNA 
^'"<213> Homo sapiens 

L,^220> 
J^;^221> CDS 
i^^'^<222> 54 . .590 

^,|Jk22l> sig_peptide 
jil:|c222> 54. .227 
<223> Von Heijne matrix 
score 3 . 5 

seq GGILMGSFQGTIA/GQ 

<221> polyA_site 
<222> 955. .965 

<400> 336 

atatttgcec cttactttat cttgtgectt gagaaattgc tggggagaga ggt atg 56 

Met 

tec act ggg cag ctg tac agg atg gag gat ata ggg egt ttc cac tec 104 
Ser Thr Gly Gin Leu Tyr Arg Met Glu Asp He Gly Arg Phe His Ser 

-55 -50 -45 

cag cag cca ggt tec etc ace cca age tea cec act gtt ggg gag att 152 
Gin Gin Pro Gly Ser Leu Thr Pro Ser Ser Pro Thr Val Gly Glu He 

-40 -35 -30 

ate tac aat aac ace aga aac aca ttg ggg tgg att ggg ggt ate ett 200 
He Tyr Asn Asn Thr Arg Asn Thr Leu Gly Trp He Gly Gly He Leu 
-25 -20 -15 -10 
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O 
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1 
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5 
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get 
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\jxy 
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VjXU 
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Pro 
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TV T _ 
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o n 
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etc 
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y '-y 


y 
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caa 


gee 


gtc 


age 
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eag 


ggw 


ate 


tac 


344 


Gly 


Ijeu 




V dx 


i-iX Ct 


It X U 


fro 




Til a 

A±a 
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Ser 


Leu 


bin 


Gly 


lie 


Tyr 
















^ u 
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3.CO 




L. 


tgg 


Ctg 




eag 


c uU 


4-4-4- 


cac 


tec 


act 


gee 


eta 


rgg 


gna 


392 






ir -L 


irp 


Leu 


Leu 


\j±n 


Leu 


Fne 


xilS 


Ser 


Thr 


Ala 


Leu 


Xaa 


Xaa 
























50 










55 






cag 


caa 


cct 


4- 

aac 




uct 


eta 


tct 


ctg 


aac 


ate 


tct 


tea 


tee 


cat 


440 










Asn 


oxy 


Ser 


Leu 


Ser 


Leu 


Asn 


lie 


Ser 


Ser 


Ser 


His 












D u 










d5 










70 






CTCt 


CCIT 


rat 


cca 


X ^ct. 






ace 


ctg 


gaa 
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gga 




gae 


cct 


ace 


488 


Ala 


PlTO 


Xaa 


It J. U 


^ctd 
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Tlir 


Leu 


vaXU 


Pro 


bxy 


val 


Asp 


Pro 


Thr 
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85 












gtc 


tgt 


att 


aat 


ccc 


cat 


ccc 


cca 


cca 


cca 


ate 


tta 


aaa 


abc 


536 


Arg 


Xaa 


Val 


Cys 


He 


Asn 


Pro 


His 


Pro 


Pro 


Pro 


Pro 


He 


Leu 


Lys 


Xaa 








90 










95 










100 








;.(;;CCt 


ctg 


tec 


ccc 


tac 


cct 


aaa 


ccc 


eag 


tta 


ggt 


ace 


eat 


get 


ggg 


caa 


584 




Leu 


Ser 


Pro 


Tyr 


Pro 


Lys 


Pro 


Gin 


Leu 


Gly 


Thr 


His 


Ala 


Gly 


Gin 






105 
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115 










1 ^%tc 


aat 


taacaattta tgcacaggta ctagttttat 


. tgtattaceg 


ttceagggta 


640 


Hval 


Asn 

































ifll20 

jSgctttgaaaa aagtatctea aaaaggcaac atgggccgag egeagtggct eacgeetgta 700 

yjatceeagcac tttgggagge caaggtgggc agategeetg aggtctggag ttcaagaeca 760 

y^cetggccaa cagggtgaaa eccegtctet acaaaaatar gaaaattrge caggtgtggt 820 

ggcagaegtc tgtrgtecea getattcagg agactgagge aegagaatte catgaaccca 880 

l,,|ggatgcggag gttgeagtga gccgagattg tgecaetgcg ctceagcetg ggcgaeagag 940 

ri^ggtattctg ttteaaaaaa aaaaamem 968 



ii!|c210> 337 
n1<211> 901 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 133 . .846 



<221> sig_peptide 

<222> 133 . .345 

<223> Von Heijne matrix 

score 9.39999961853027 
seq WSFLLLLAGLIA/TY 



<22l> polyA_site 
<222> 890. .901 



<400> 337 

aagcagctte caggatectg agatccggag cagceggggt eggagcggct cetcaagagt 60 
tactgatcta tnnatggcag agaaaaaaaa attgtgacea gagacgtgta geaatgaaca 120 
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aggaacrtca ta atg rwn nnk ttc aca gac ccc tct tea gtg aat gaa aag 171 
Met Xaa Xaa Phe Thr Asp Pro Ser Ser Val Asn Glu Lys 
-70 -G5 -60 

aag agg agg gag egg gaa gaa agg cag aat att gtc ctg tgg aga cag 219 
Lys Arg Arg Glu Arg Glu Glu Arg Gin Asn He Val Leu Trp Arg Gin 

-55 -50 -45 

ccg etc att acc ttg cag tat ttt tct ctg gaa ate ctt gta ate ttg 267 
Pro Leu He Thr Leu Gin Tyr Phe Ser Leu Glu He Leu Val He Leu 

-40 -35 -30 

aag gaa tgg acc tea aaa tta tgg eat cgt caa age att gtg gtg tct 315 
Lys Glu Trp Thr Ser Lys Leu Trp His Arg Gin Ser He Val Val Ser 

-25 -20 -15 

ttt tta ctg ctg ctt get ggg ctt ata get aeg tat tat gtt gaa gga 363 
Phe Leu Leu Leu Leu Ala Gly Leu He Ala Thr Tyr Tyr Val Glu Gly 
-10 -5 15 

gtg cat caa cag tat gtg caa cgt ata gag aaa cag ttt ctt ttg tat 411 
Val His Gin Gin Tyr Val Gin Arg He Glu Lys Gin Phe Leu Leu Tyr 

10 15 20 

gee tac tgg ata ggc tta gga att ttg tct tct gtt ggg ctt gga aca 459 
Ala Tyr Trp He Gly Leu Gly He Leu Ser Ser Val Gly Leu Gly Thr 
25 30 35 

p,0gg ctg cae acc ttt ctg ctt tat ctg ggt cea cat ata gee tea gtt 507 
];|31y Leu His Thr Phe Leu Leu Tyr Leu Gly Pro His He Ala Ser Val 
131 40 45 50 

;'4ea tta get get tat gaa tge aat tea gtt aat ttt ccc gaa cea ecc 555 

J'jhr Leu Ala Ala Tyr Glu Cys Asn Ser Val Asn Phe Pro Glu Pro Pro 

ijj55 60 65 70 

■+™tat cet gat cag att att tgt cea gat gaa gag ggc act gaa gga ace 603 

UM^yr Pro Asp Gin He He Cys Pro Asp Glu Glu Gly Thr Glu Gly Thr 

yi 75 80 85 

att tct ttg tgg agt ate ate tea aaa gtt agg att gaa gee tge atg 651 
^,|Ile Ser Leu Trp Ser He He Ser Lys Val Arg He Glu Ala Cys Met 

90 95 100 

1^99 ggt ate ggt aca gea ate gga gag ctg cet cea tat ttc atg gee 699 
["Trp Gly He Gly Thr Ala He Gly Glu Leu Pro Pro Tyr Phe Met Ala 
,I;J 110 115 

y.kga gea get cgc etc tea ggt get gaa cea gat gat gaa gag tat cag 747 
[l.Arg Ala Ala Arg Leu Ser Gly Ala Glu Pro Asp Asp Glu Glu Tyr Gin 
120 125 130 

gaa ttt gaa gag atg ctg gaa cat gea gag tct gea caa gta aga aca 795 
Glu Phe Glu Glu Met Leu Glu His Ala Glu Ser Ala Gin Val Arg Thr 
135 140 145 150 

gtg ggg ata gaa aat aga aca ett tac ttc ttc eta aag agg eta tta 843 
Val Gly He Glu Asn Arg Thr Leu Tyr Phe Phe Leu Lys Arg Leu Leu 

155 160 165 

agg taaaattgtt agtagttaet etgaagaaga aaactgctaa agtaaaaaaa aaaaa 901 
Arg 



<210> 338 

<211> 1347 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
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<222> 138 . ,671 

<221> sigjpeptide 
<222> 138. .248 
<223> Von Heijne matrix 
score 3 . 5 

seq LVFNFLLILTILT/IW 

<221> polyA_signal 
<222> 1319. .1324 

<221> polyA_site 
<222> 1338 . .1347 

<400> 338 

aagaatgctt gtgaagtagc aactaaagtg gcagtgtttc ttctgaaatt ctcaggcagt 60 
cagactgtct taggcaaatc ttgataaaat agcccttatc caggttttta tctaaggaat 120 
cccaagaaga ctgggga atg gag aga cag tea agg gtt atg tea gaa aag 170 

Met Glu Arg Gin Ser Arg Val Met Ser Glu Lys 
-35 -30 



gat 


gag 


tat 


cag 


ttt 


caa 


cat 


cag 


gga 
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gtg 
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I "^ksn 
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He 
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-5 
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He 
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He 
1 
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5 
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314 
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aya 


atg 


gga 


eta 


att 


tta 


esa 
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Xaa 
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Gly 
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He 
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Thr 
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He 








25 
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eta 
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cca 


tea 


410 


risiu 


Ser 


Gly Xaa 


Val 


Tyr 


Asp 


Cys 


Val 


Lys 


Leu 


Thr 
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Ser 


Pro 


Ser 
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50 












''%ct 


ctg 


ctg 


gtt 


aat 


ate 


act 


gae 


caa 


gtt 


tat 


gar 


tat 


aaa 


tac 


aar 


458 


^lilrhr 


Leu 


Leu 


Val 


Asn 


He 


Thr 


Asp 


Gin 


Val 


Tyr 


Glu 


Tyr 


Lys 


Tyr 


Lys 














60 










65 










70 




aga 


gaa 


ata 


agt 


cag 


eac 


amc 


ate 


aat 


cet 


cat 


cam 


gga 


aat 


get 


ata 


506 


Arg 


Glu 


He 


Ser 


Gin 


His 


Xaa 


He 


Asn 


Pro 


His 


Xaa 


Gly Asn 


Ala 


He 












75 










80 










85 






ctt 


gaa 


aag 


atg 


aca 


ttt 


gat 


cca 


raa 


ate 


ttc 


ttc 


aat 


gtt 


tta 


ctg 


554 


Leu 


Glu 


Lys 


Met 


Thr 


Phe 


Asp 


Pro 


Xaa 


He 


Phe 


Phe 


Asn 


Val 


Leu 


Leu 










90 










95 










100 








cca 


cca 


att 


ata 


ttt 


cat 


gea 


gga 


tat 


agt 


eta 


aag 


aag 


aga 


eac 


ttt 


602 


Pro 


Pro 


He 


He 


Phe 


His 


Ala Gly 


Tyr 


Ser 


Leu 


Lys 


Lys 


Arg 


His 


Phe 








105 










110 










115 








ttt 


caa 


aac 


tta 


gga 


tet 


att 


tta 


acg 


tat 


gee 


ttc 


ttg gga 


act 


gee 


650 


Phe 


Gin 


Asn 


Leu 


Gly 


Ser 


He 


Leu 


Thr 


Tyr 


Ala 


Phe 


Leu 


Gly 


Thr 


Ala 





120 125 130 

ate tec tge ate gte ata ggg taagtgaeat teggagctea agttgeaggt 701 
He Ser Cys He Val He Gly 
135 140 

ggctgtgggg tcygtgatct gtgtgaggga tetaacaett ceaggattet tgctggckgg 761 
gaaaattgtc ttttttttar tawatcacaw atttgtatgt tttttcwgac ttaattccae 821 
ggcttckgam aaatacaagg ettcaaatca aagcaaacta waggattgct ggactttete 881 
tgtgagttet ggacttctga ettagggaat gtggatcaet tgcettgagt tatgtgaage 941 
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gcattgcatt cttcttttag tttgagtaat sccgatatgc tcactgcatt cttttttgtc 1001 

ttgtattgag agaccttacc tgtatttggc aggagtgcaa aagtaactat atgccaagag 1061 

ttttctttct aaaggaaagt ttacaagaca gcagtctgaa acagatatgt ccaaatatca 1121 

acagagttgc ttaatacagg gatagctttt cagttaatac cctgtagaat gcagactctt 1181 

tttttcattg tattttcttg attatgctac tgagccctaa gtcacacgtt atatactctg 1241 

gcttgcagct catcataaag taaaatgtgg taccaaatgg tgaaggcaat ccagcctctg 1301 

ataatcccgt ccaatacatt aaagctccac tgcaggaaaa aaaaaa 1347 



<210> 339 
<211> 987 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 124 . .411 

<221> sigjpeptide 

<222> 124 . .186 
i^.,f223> Von Heijne matrix 
'j^ score 6.30000019073486 

seq MVALCCCLWKISG/CE 

5''j:221> polyA^signal 
l{l:222> 948. .953 

yjc221> polyT^site 
y^|:222> 971. .983 

L|5400> 339 

nfagacgctgc ctttagggag agataaaaag cataatgaca ttagctagga aagttaattt 60 
y^cagttctta ctgaagtgct gtatgaaact gaaatttcca aggaactgaa ttttgtgagc 120 



paa 


atg 


age 


atg 


caa 


tte 


ttg 


ttt 


aag 


atg 


gtg 


gee 


tta 


tgc 


tgt 


tgt 


168 




Met 


Ser 


Met 


Gin 


Phe 


Leu 


Phe 


Lys 


Met 


Val 


Ala 


Leu 


Cys 


Cys 


Cys 




I'ptc 




-20 










-15 










-10 








tgg 


aag 


ate 


tec 


gge 


tgt 


gag 


gaa 


gte 


cct 


eta 


act 


tae 


aae 


ctg 


216 


Leu 


Trp 
-5 


Lys 


He 


Ser 


Gly 


Cys 
1 


Glu 


Glu 


Val 


Pro 


Leu 


Thr 


Tyr 


Asn 


Leu 




etc 


aag 


tgc 


etc 


eta 


gat 


aaa 


gcg 


eac 


tgt 


5 

gta 


etc 


ctg 


aca 


ect 


10 
tgt 


264 


Leu 


Lys 


Cys 


Leu 


Leu 


Asp 


Lys 


Ala 


His 


Cys 


Val 


Leu 


Leu 


Thr 


Pro 


Cys 












15 










20 










25 




ggt 


tac 


ate 


ttt 


tec 


ttg 


ate 


agt 


cca 


gaa 


att 


etc 


aaa 


etc 


aet 


tta 


312 


Gly 


Tyr 


He 


Phe 


Ser 


Leu 


He 


Ser 


Pro 


Glu 


He 


Leu 


Lys 


Leu 


Thr 


Leu 










30 










35 








40 








ate 


act 


ttg 


cav 


ate 


etc 


tta 


ata 


etc 


aaa 


aat 


eta 


eac 


tta 


ctg 


tgg 


360 


He 


Thr 


Leu 


Xaa 


He 


Leu 


Leu 


He 


Leu 


Lys 


Asn 


Leu 


His 


Leu 


Leu 


Trp 








45 










50 










55 








ctg 


aca 


gtt 


tea 


age 


awa 


tgt 


gtt 


eat 


egc 


agt 


agt 


gca 


aga 


aaa 


gaa 


408 


Leu 


Thr 


Val 


Ser 


Ser 


Xaa 


Cys 


Val 


His 


Arg 


Ser 


Ser 


Ala 


Arg 


Lys 


Glu 






60 










65 










70 








aag 


tagaagaacc ctgcagagat ttgatggaac 


: ceagctteta 


ttcattaaaa 




461 



Lys 
75 

ecaatggeaa aatataaage aaataggagg tgacgaaggt tacaaaaata egtattgttt 521 
atgttttccc tggggtgtge tgattgtcag geateagtte eetgtgeeat teattceeca 581 
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acacagcatg catcagaaat tttatcaata aatgctttct ctctcaatgt tcaacctatg 641 

ctgatagacc attaaataca gtttttgggt tcacagcttg tcatcatcat ttgtctatac 701 

ctgtggcaaa gaatatctaa taagatactc tcagcatttt gcacacttaa actaagatgc 761 

tgaatgctgt attttacgga ataatcagcc acattaaatt tggagactca acaagcatgc 821 

tgtgaacatt caacattagg tttaaatttt atttttaaaa gttaataata aaaggatata 881 

tgttaagtat tatgaaaccc tgcatatact gtaataaaat ggtggatgtg aatggacaat 941 

atatgcaata aaatttataa tttgattcya aaaaaaaaaa aamccv 987 



<210> 340 
<211> 748 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 372. .494 

<221> sig_peptide 

<222> 372 . .443 
^;,ir^223> Von Heijne matrix 

score 5.30000019073486 
:;!^ seq RILLLHFYCLLRS/SE 

l^y221> polyA_signal 
ii^lc222> 708 . .713 

ij.,k221> polyA^site 
y%222> 732 . .745 

y^400> 340 

j;^ pcatgaaatg tgcttggtct gtgatctctt ggtcagatat ctgccttcca ggcgatcctt 60 
j'?^gaggttgtg taattcagct ggccctggct cctggtccct gttactgagc tgggcagtcg 12 0 
["'aaccgaaggc agatgagctc aagatcatgc cttgggaagc atggtgctct aggggtgcct 180 
''^€tttattcct ttcattgtat tatagactgt ttccaagttt atggttagaa atggtaaagt 240 
yfeggtctggtg ttttgaggta gaacccagcc tagggcaaga tatgaactgt tcttgaggta 300 
yi&aaatgtcta cagtcagttg tttcatctag cttgcatctt aaaacacaaa cccttcagtt 360 
gctttcactt a atg cac aca ttt gcc aat gac aga ggg tta tac agg ate 410 
Met His Thr Phe Ala Asn Asp Arg Gly Leu Tyr Arg lie 
-20 -15 
ctt ctt tta cat ttc tat tgt ctg eta cgc tea tea gag tat att ttg 458 
Leu Leu Leu His Phe Tyr Cys Leu Leu Arg Ser Ser Glu Tyr lie Leu 

-10 -5 15 

ggg tac aag gtt ttg ggg gtt ttt tty ccc att ttg taactgcctt 504 
Gly Tyr Lys Val Leu Gly Val Phe Phe Pro lie Leu 

10 15 
attgaaaadt aaktgcectt ceattccagg ectcetcata ttgtaettgt ttectgecaa 564 
atctggggga tcatttgtat tttaactttg taatctatgg ctctgtactg ttgaaagste 624 
teaattctgt ggggtctcct tagtatgtat gtgactttte atgttgeaat atcacaegat 684 
gggatggecc gaettttgct cttaataaat aatetgaatg agtaagaraa aaaaaaaaaa 744 



<210> 341 
<211> 1106 
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<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 112 . .450 

<221> sig_peptide 

<222> 112 . .192 

<223> Von Heijne matrix 

score 7.19999980926514 

seq SLLFFLLLEGGXT/EQ 

<221> polyA_signal 
<222> 1053. .1058 

<221> polyA_site 
<222> 1095. .1106 

<400> 341 

aagacctcgg aacgagagcg ccccggggag ctcggagcgc gtgcacgcgt ggcavacgga 60 
^;.,paaggcvakk rcnnnnrctt gaaggttctg tcaccttttg cagtggtcca a atg aga 117 



Met Arg 



^Ij^iraa 


aag 


tgg 


aaa 


atg 


gga 


ggc 


atg 


aaa 


tac 


ate 


ttt 


teg 


ttg 


ttg 


ttc 


165 


J^^aa 


Lys 


Trp 


Lys 


Met 


Gly 


Gly 


Met 


Lys 


Tyr 


He 


Phe 


Ser 


Leu 


Leu 


Phe 














-20 










-15 










-10 




y'lttt 


ctt 


ttg 


eta 


gaa 


gga 


ggc 


kaa 


aca 


gag 


caa 


gtr 


amn 


eat 


tea 


gag 


213 


JSphe 


Leu 


Leu 


Leu 


Glu 
-5 


Gly 


Gly 


Xaa 


Thr 


Glu 


Gin 


Val 


Xaa 


His 


Ser 


Glu 




IJIaca 


tat 


tgc 


atg 


ttt 


caa 


gac 


aag 


aag 


1 

tac 


aga 


gtg 


ggt 


5 

gag 


aga 


tgg 


261 


Thr 


Tyr 


Cys 


Met 


Phe 


Gin 


Asp 


Lys 


Lys 


Tyr 


Arg Val 


Gly 


Glu 


Arg 


Trp 








10 










15 










20 








U :Pat 


cot 


tac 


ctg 


gaa 


cet 


tat 


ggg 


ttg 


gtt 


tac 


tgc 


gtg 


aac 


tgc 


ate 


309 


r!kis 


Pro 


Tyr 


Leu 


Glu 


Pro 


Tyr 


Gly 


Leu 


Val 


Tyr 


Cys 


Val 


Asn 


Cys 


He 






25 










30 










35 












tea 


gag 


aat 


ggg 


aat 


gtg 


ctt 


tgc 


age 


ega 


gtc 


aga 


tgt 


cca 


aat 


357 


Ser 


Glu 


Asn 


Gly 


Asn 


Val 


Leu 


Cys 


Ser 


Arg 


Val 


Arg 


Cys 


Pro 


Asn 














45 










50 










55 




gtt 


cat 


tgc 


ctt 


tct 


cct 


gtg 


cat 


att 


cct 


eat 


ctg 


tgc 


tgc 


cct 


egc 


405 


Val 


His 


Cys 


Leu 


Ser 


Pro 


Val 


His 


He 


Pro 


His 


Leu 


Cys 


Cys 


Pro 


Arg 












60 










65 










70 




tgc 


cca 


gaa 


gac 


tec 


tta 


ccc 


cca 


gtg 


aac 


aat 


rwg 


gtg 


acc 


age 




450 


Cys 


Pro 


Glu 


Asp 


Ser 


Leu 


Pro 


Pro 


Val 


Asn 


Asn 


Xaa 


Val 


Thr 


Ser 












75 










80 










85 









tagtcttgek agtacaatgg gaeaacttae caaeatggas agctgttcgt agetgrrggg 510 

ctctttcaga ateggeaace cmatcaatge acccagtgea gctgttcgga rggaaaektg 570 

tattgtggte tcaagacttg ceceaaatta aectgtgect tcccagtetc tgttccarat 630 

tectgetgee gggtwtgcag argagatgga caaetgtcat gggaacmttc tgatggtgat 690 

atcttccggc aacctgccaa eagagaagea agacattett aecacegcte teactatgat 750 

ectecaeeaa gccgacaggc tggaggtctg tceegctttc ctggggccag aagtcaccgg 810 

ggagctctta tggatteeca gcaagcatca ggaaccattg tgeaaattgt eatcaataac 870 

aaaeacaagc atggacaagt gtgtgtttcc aatggaaaga cctattctea tggcgagtee 930 

tggcaeccaa acctcegggc atttggcatt gtggagtgtg tgctatgtac ttgtaatgtc 990 

accaagcaag agtgtaagaa aatccactge cccaatcgat acecctgcaa gtatccteaa 1050 

aaaatagacg gaaaatgctg eaaggtgtgt ecaggtaaaa aagcaaaaaa aaaaaa no 6 
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<210> 342 

<211> 1191 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 117. .866 

<221> sigjeptide 

<222> 117 . .170 

<223> Von Heijne matrix 

score 10.6999998092651 

seq LILLALATGLVGG/ET 

<221> polyA_signal 
<222> 1159. .1164 

<221> polyA__site 
<222> 1178 . .1190 

J^::;Jc400> 342 

:;:^^aaacccagc ctacctgctg tagctgccgc cactgccgtc tccgccgcca ctggwccccc 60 
^^%gagcbnmag ccccagagcc taggaacctg gggcccgctc ctcccccctc caggcc atg 119 



Met 



y kgg 


att 


ctg 


cag 


tta 


ate 


ctg 


ett 


get 


ctg 


gea 


aca 


ggg 


ett 


gta 


ggg 


167 


iS^rg 


He 


Leu 


Gin 


Leu 


He 


Leu 


Leu 


Ala 


Leu 


Ala 


Thr 


Gly Leu Val 


Gly 








-15 










-10 










-5 










gag 


acc 


agg 


ate 


ate 


aag 


ggg 


ttc 


gag 


tgc 


aag 


ect 


cae 


tec 


cag 


215 




Glu 


Thr 


Arg 


He 


He 


Lys 


Gly Phe 


Glu 


Cys 


Lys 


Pro 


His 


Ser 


Gin 






1 








5 










10 










15 




n pec 
|;Pro 


tgg 


cag 


gea 


gee 


ctg 


ttc 


gag 


aag 


aeg 


egg 


eta 


etc 


tgt 


ggg 


gcg 


263 


Trp 


Gin 


Ala 


Ala 


Leu 


Phe 


Glu 


Lys 


Thr 


Arg 


Leu 


Leu 


Cys 


Gly Ala 












20 










25 










30 






';acg 


etc 


ate 


gcc 


ecc 


aga 


tgg 


etc 


ctg 


aca 


gea 


gcc 


cae 


tgc 


etc 


aag 


311 


Ikhr 


Leu 


He 


Ala 


Pro 


Arg 


Trp 


Leu 


Leu 


Thr 


Ala 


Ala 


His 


Cys 


Leu 


Lys 










35 










40 










45 






ccc 


cgc 


tac 


ata 


ktt 


cae 


ctg 


ggg 


cag 


cae 


aac 


etc 


cag 


aag 


gag 


gag 


359 


Pro 


Arg 


Tyr 


He 


Xaa 


His 


Leu Gly Gin 


His 


Asn 


Leu 


Gin 


Lys 


Glu 


Glu 








50 










55 










60 








ggc 


tgt 


gag 


ear 


ace 


egg 


aca 


gcc 


act 


gag 


tec 


ttc 


ecc 


eac 


ecc 


ggc 


407 


Gly 


Cys 


Glu 


Gin 


Thr 


Arg 


Thr 


Ala 


Thr 


Glu 


Ser 


Phe 


Pro 


His 


Pro 


Gly 






65 










70 










75 










ttc 


aac 


aac 


age 


etc 


ecc 


aac 


aaa 


gae 


cam 


mgc 


aat 


gac 


ate 


atg 


ctg 


455 


Phe 


Asn 


Asn 


Ser 


Leu 


Pro 


Asn 


Lys 


Asp 


Xaa 


Xaa 


Asn 


Asp 


He 


Met 


Leu 




80 


aak 








85 










90 








95 




gtg 


atg 


gtna 


teg 


cea 


gtc 


tec 


ate 


ace 


tgg get 


gtg 


cga 


ecc 


etc 


503 


Val 


Xaa 


Met 


Xaa 


Ser 


Pro 


Val 


Ser 


He 


Thr 


Trp 


Ala 


Val 


Arg 


Pro 


Leu 












100 










105 








110 






acc 


etc 


tee 


tea 


cgc 


tgt 


gtc 


act 


get 


ggc 


ace 


age 


tgc 


etc 


att 


tec 


551 


Thr 


Leu 


Ser 


Ser 


Arg 


Cys 


Val 


Thr 


Ala 


Gly Thr 


Ser 


Cys 


Leu 


He 


Ser 










115 










120 










125 








ggc 


tgg 


ggc 


age 


aeg 


tec 


age 


ecc 


cag 


tta 


cgc 


ctg 


ect 


cae 


acc 


ttg 


599 


Gly 


Trp Gly Ser 


Thr 


Ser 


Ser 


Pro 


Gin 


Leu 


Arg Leu 


Pro 


His 


Thr 


Leu 








130 










135 










14 0 










cga 


tgc 


gcc 


aac 


ate 


ace 


ate 


att 


gag 


cae 


cag 


aag 


tgt 


gag 


aac 


gcc 


647 
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Arg 


Cys 


Ala 


Asn 


He 


Thr 


He 

Xo U 


He 


Glu 


His 


Gin 


Lys 
155 


Cys 


Glu 


Asn 


Ala 




t3.C 




gy c 






aca 


gac 


acc 


aug 


gtg 


tgt 


gee 


age 


gtg 


cag 


gaa 


695 


Tyr 


Pro 


Gly 


Asn 


He 


Thr 


Asp 


Thr 


Met 


Val 


Cys 


Ala 


Ser 


Val 


Gin 


Glu 














lob 










170 










175 








aag 


gac 


tec 


tgc 


cag 




gac 


tec 


ggg 


ggc 


cet 


ctg 


gtc 


tgt 


743 






Lys 


Asp 


Ser 
180 


Cys 


Cjin 


Gly 


Asp 


Ser 
185 


Gly 


Gly 


Pro 


Leu 


Val 
190 


Cys 




aa.c 


cag 


tct 


ctt 


caa 


ggc 


att 


ate 


tec 


tgg 


ggc 


cag 


gat 


ccg 


tgt 


gcg 


791 


Asn 




Ser 


Leu 
195 


Gin 


Gly 


He 


He 


Ser 
200 


Trp 


Gly 


Gin 


Asp 


Pro 
205 


Cys 


Ala 




ate 


acc 


cga 


aag 


cat 


ggt 


gtc 


tac 


acg 


aaa 


gtc 


tgc 


aaa 


tat 


gtg 


gac 


839 


He 


Thr 


Arg 


Lys 


Pro 


Gly 


Val 


Tyr 


Thr 


Lys 


Val 


Cys 


Lys 


Tyr 


Val 


Asp 








210 










215 










220 








tgg 


ate 


cag 


gag 


acg 


atg 


aag 


aac 


aat 


tagactggae ecacecacca 




886 


Trp 


He 


Gin 


Glu 


Thr 


Met 


Lys 


Asn 


Asn 



















225 230 



cageccatea ecctccattt ccacttggtg tttggtteet gttcactctg ttaataagaa 946 

accctaagcc aagaccctct acgaacattc tttgggcctc ctggactaca ggagatgctg 1006 

tcacttaata atcaacctgg ggttcgaaat cagtgagace tggattcaaa ttctgccttg 1066 

aaatattgtg actctgggaa tgacaacacc tggtttgttc tctgttgtat eeccagcccc 1126 

^..paakweaget cctggceata tatcaaggtt tcaataaata tttgctaaat gaawaaaaaa 1186 

.sf^aaac 1191 



Ulc210> 343 
^|'<211> 1070 
L,ik212> DNA 
|,ri<213> Homo sapiens 

1.1522 0> 

k|f:221> CDS 
r^222> 13 . .465 

''J;^221> sig_peptide 

^fic222> 13 . . 75 

0;pc223> Von Heijne matrix 

score 3.90000009536743 
seq PVAVTAAVAPVLS/IN 



<221> polyA_signal 
<222> 1035. .1040 



<221> polyA_site 
<222> 1060. .1070 



<400> 343 

agagteggga aa atg get gcg agt acc tec atg gtc ccg gtg get gtg acg 51 
Met Ala Ala Ser Thr Ser Met Val Pro Val Ala Val Thr 
-20 -15 -10 

gcg gea gtg gcg cet gtc ctg tec ata aac age gat ttc tea gat ttg 99 
Ala Ala Val Ala Pro Val Leu Ser He Asn Ser Asp Phe Ser Asp Leu 

-5 15 
egg gaa att aaa aag caa ctg ctg ctt att gcg ggc ctt ace egg gag 147 
Arg Glu He Lys Lys Gin Leu Leu Leu He Ala Gly Leu Thr Arg Glu 
10 15 20 
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egg 


ggc 


eta 


eta 


cae 


agt 


age 


aaa 


tgg 


teg 


gcg 


gag 


ttg 


get 


tte 


tct 


195 


Arg 


Gly 


Leu 


Leu 


His 


Ser 


Ser 


Lys 


Trp 


Ser 


Ala 


Glu 


Leu 


Ala 


Phe 


Ser 




25 










30 










35 










40 




etc 


cct 


gca 


ttg 


cct 


ctg 


gee 


gag 


ctg 


eaa 


ceg 


cct 


ceg 


cct 


att 


aca 


243 


Leu 


Pro 


Ala 


Leu 


Pro 
45 


Leu 


Ala 


Glu 


Leu 


Gin 
50 


Pro 


Pro 


Pro 


Pro 


He 
55 


Thr 




gag 


gaa 


gat 


gcc 


cag 


gat 


atg 


gat 


gcc 


tat 


ace 


ctg 


gcc 


aag 


gee 


tac 


291 


Glu 


Glu 


Asp 


Ala 
60 


Gin 


Asp 


Met 


Asp 


Ala 
65 


Tyr 


Thr 


Leu 


Ala 


Lys 
70 


Ala 


Tyr 




ttt 


gac 


gtt 


aaa 


gag 


tat 


gat 


egg 


gca 


gca 


eat 


tte 


ctg 


cat 


ggc 


tgc 


339 


Phe 


Asp 


Val 
75 


Lys 


Glu 


Tyr 


Asp 


Arg 
80 


Ala 


Ala 


His 


Phe 


Leu 
85 


His 


Gly 


Cys 




aat 


gca 


aga 


aaa 


gcc 


tat 


ttt 


ctg 


tat 


atg 


tat 


tec 


aga 


tat 


ctg 


gtg 


387 


Asn 


Ala 
90 


Arg 


Lys 


Ala 


Tyr 


Phe 
95 


Leu 


Tyr 


Met 


Tyr 


Ser 
100 


Arg 


Tyr 


Leu 


Val 




agg 


gcc 


att 


tta 


aaa 


tgt 


cat 


tct 


gcc 


ttt 


agt 


gaa 


aca 


tee 


ata 


ttt 


435 


Arg 


Ala 


He 


Leu 


Lys 


Cys 


His 


Ser 


Ala 


Phe 


Ser 


Glu 


Thr 


Ser 


He 


Phe 




105 










110 










115 










120 




aga 


acc 


aat 


gga 


aaa 


gtt 


aaa 


tct 


ttt 


aaa 


tagcttagca gtgggccact 


485 


Arg 


Thr 


Asn 


Gly 


Lys 
125 


Val 


Lys 


Ser 


Phe 


Lys 
130 

















l^i^aatgaatgt aetttataea tageaataat aaaaaaaaga tateataaat aaagttaaaa 545 

IjJf ggatggtaa aaaaaaaaat attcttagga atgactaaca ggataagtaa caacetgatt 605 

.l^'^tttatttae tttaggttat ataaggttct teatgcetgt gaattaatat tattgtgtaa 665 

[!&aattaagtt aaaaageetg ggetgacttt taaatttata aattcattta tcatgtttat 725 

fiifgtatattta ttgtttttct tteatggcta ttaaaaagta tgactgtaaa ggaeaatgea 785 

^^j^gtaaaeeaa ettaatactg tattgaataa taagtacaat ttattatttt aetttgaaac 845 

+attatgaatt tactttceta etttttctta gttgtt'atct atataaattg attaaaaaaa 905 

y^attttatgt aettcteatt tectagtaca ggttgagtat eccttatttg aagtgcttgg 965 

y%aeeaaaagt gtttcagatt tcagattttt tteagatttt ggtatatttg cattatactt 1025 

. aetggttgaa ataaaaaatg ctgeagtgag tgtcaaaaaa aaaaa 1070 



:''k2l0> 344 

■];Jc211> 1213 

'^k212> DNA 

C^ik213> Homo sapiens 

<220> 
<221> CDS 
<222> 2 . . 718 

<221> sig_peptide 
<222> 2. .76 

<223> Von Heijne matrix 

score 3.90000009536743 
seq RVGLLLGGGGVYG/SR 

<221> polyA_signal 
<222> 1170. .1175 

<221> polyA_site 
<222> 1203 . .1213 

<400> 344 

a atg cec egg aag egg aag tgc gat ett egg get gtc aga gtt ggt ctg 49 
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Met Pro Arg Lys Arg Lys Cys Asp Leu Arg Ala Val Arg Val Gly Leu 
-25 -20 -15 -10 

tta etc ggt ggt ggc gga gtc tac gga age cgt ttt cgc ttc act ttt 97 
Leu Leu Gly Gly Gly Gly Val Tyr Gly Ser Arg Phe Arg Phe Thr Phe 

-5 15 
cct ggc tgt aga gcg ctt tec cec tgg egg gtg aga vtg cag aga ega 145 
Pro Gly Cys Arg Ala Leu Ser Pro Trp Arg Val Arg Xaa Gin Arg Arg 

10 15 20 

agg tgc gag atg age act atg ttc gcg gae act etc etc ate gtt ttt 193 
Arg Cys Glu Met Ser Thr Met Phe Ala Asp Thr Leu Leu He Val Phe 

25 30 35 

ate tct gtg tgc acg get etg etc gea gag ggc ata ace tgg gtc etg 241 
He Ser Val Cys Thr Ala Leu Leu Ala Glu Gly He Thr Trp Val Leu 
40 45 50 55 

gtt tac agg aca gae aag tac aag aga etg aag gea gaa gtg gaa aaa 289 
Val Tyr Arg Thr Asp Lys Tyr Lys Arg Leu Lys Ala Glu Val Glu Lys 

60 65 70 

cag agt aaa aaa ttg gaa aag aag aag gaa aea ata aca gag tea get 337 
Gin Ser Lys Lys Leu Glu Lys Lys Lys Glu Thr He Thr Glu Ser Ala 

75 80 85 

ggt ega caa cag aaa aar aaa ata gag aga cdd kaa kas ame etg arg 385 
,,,piy Arg Gin Gin Lys Lys Lys He Glu Arg Xaa Xaa Xaa Xaa Leu Xaa 

■;J^^at aae aac aga gat eta tea atg gtt ega atg aaa tee atg ttt get 433 
Y^sn Asn Asn Arg Asp Leu Ser Met Val Arg Met Lys Ser Met Phe Ala 

H^^^ ^tg gg^ ^tg ttc aat tee ata ttt 

.|!ile Gly Phe Cys Phe Thr Ala Leu Met Gly Met Phe Asn Ser He Phe 
y.P-20 125 130 135 

y'&at ggt aga gtg gtg gea aag ctt cct ttt ace ect ett tet tas rte 529 

Asp Gly Arg Val Val Ala Lys Leu Pro Phe Thr Pro Leu Ser Xaa Xaa 
hh 140 145 150 

:;,pra gga etg tet eat ega aat etg etg gga gat gae aee aea gae tgt 577 
L'i?^^^ ^-^y Gly Asp Asp Thr Thr Asp Cys 

n 155 160 165 

'jfccc ttc att ttc etg taw att etc tgt act atg teg att ega cag aae 
y;Ber Phe He Phe Leu Xaa He Leu Cys Thr Met Ser He Arg Gin Asn 
tn 170 175 180 

att cag aag att etc gge ctt gee ect tea ega gee gee acc aag cag 673 
He Gin Lys He Leu Gly Leu Ala Pro Ser Arg Ala Ala Thr Lys Gin 

185 190 195 

gea ggt gga ttt ett ggc cca cca ect ect tct ggg aag ttc tet 718 
Ala Gly Gly Phe Leu Gly Pro Pro Pro Pro Ser Gly Lys Phe Ser 
200 205 210 

tgaacteaag aaetetttat tttetakeat tetttetaga caeacaeaca tcagactggc 778 
aactgttttg tascaagagc cataggtagc ettaekactt gggcctcttt ctagttttga 838 
attattteta ageettttgg gtatkattag agtgaaaatg gcagecagca aacttgatag 898 
tgettttggt cetagatgat ttttateaaa taagtggatt gattagttaa gttcaggtaa 958 
tgtttatgta atgaaaaaca aatagcatee ttettgtttc atttaeataa gtattttctg 1018 
tgggaecgae tetcaaggea ctgtgtatgc cetgcaagtt ggetgtetat gageatttag 1078 
agatttagaa gaaaaattta gtttgtttaa eecttgtaae tgtttgtttt gttgttgttt 1138 
ttttttcaag ceaaatacat gacataarat caataaarag gecaaatttt tasctgtttt 1198 
atgtaaaaaa aaaaa 1213 



481 



625 



<210> 345 



-284- 



<211> 978 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 86. .709 

<221> sig_peptide 

<222> 86. .361 

<223> Von Heijne matrix 

score 6.30000019073486 
seq LLMSILALIFIMG/NS 

<221> polyA_signal 
<222> 943. .948 

<221> polyA_site 
<222> 963 . .973 



<400> 345 

p^paagcatcct tccctaggac tgctgtaagc tttgagcctc tagcaggaga catgcctcgg 60 
1;ggacgaaaga gtcggcgccg ccgta atg cga gag ccg cag aag aga acc gca 112 
i'^- Met Arg Glu Pro Gin Lys Arg Thr Ala 

L?! -85 

="|kca ate gca aaa tyc rrg gcs tva gag ggc etc cga gac ccc tat ggc 160 

y thr lie Ala Lys Xaa Xaa Ala Xaa Glu Gly Leu Arg Asp Pro Tyr Gly 

4S -80 -75 -70 

yi^gc etc tgt ggt age gag eac ccc cga aga eca cet gag egg cec gag 208 
\I%rg Leu Cys Gly Ser Glu His Pro Arg Arg Pro Pro Glu Arg Pro Glu 

-65 -60 -55 

L,jgaa gac ccg age act cca gag gag gee tct acc acc ect gaa gaa gee 256 
iPlu Asp Pro Ser Thr Pro Glu Glu Ala Ser Thr Thr Pro Glu Glu Ala 
\-l "^^ "^^ '^^ 

''jicg age act gee caa gca caa aag ect tea gtg ccc egg age aat ttt 3 04 

'^;Ser Ser Thr Ala Gin Ala Gin Lys Pro Ser Val Pro Arg Ser Asn Phe 
^jj^35 -30 -25 -20 

i:^;fcag ggc ace aag aaa agt etc ctg atg tct ata tta gcg etc ate ttc 352 
Gin Gly Thr Lys Lys Ser Leu Leu Met Ser He Leu Ala Leu He Phe 

-15 -10 -5 

ate atg ggc aac age gee aag gaa get ctg gtc tgg aaa gtg ctg ggg 400 
He Met Gly Asn Ser Ala Lys Glu Ala Leu Val Trp Lys Val Leu Gly 

15 10 
aag tta gga atg cag cet gga cgt cas eac age ate ttt gga gat ccg 448 
Lys Leu Gly Met Gin Pro Gly Arg Xaa His Ser He Phe Gly Asp Pro 

15 20 25 

aag aar ate gtc aea gaa ran ttt gtg cgc aga ggg tac ctg att tat 496 
Lys Lys He Val Thr Glu Xaa Phe Val Arg Arg Gly Tyr Leu He Tyr 
30 35 40 45 

ara ccg gtg ccc cgt abe agt ccg gtg gag tat gas tte ttc tgg ggg 544 
Xaa Pro Val Pro Arg Xaa Ser Pro Val Glu Tyr Xaa Phe Phe Trp Gly 

50 55 60 

ecc cga gca eac gtg gaa teg age ara ctg aaa stc wtg cat ttt gtg 592 
Pro Arg Ala His Val Glu Ser Ser Xaa Leu Lys Xaa Xaa His Phe Val 

65 70 75 

gca agg gtt cgt aac cga tgc tct aaa gac tgg ect tgt aat tat gac 640 
Ala Arg Val Arg Asn Arg Cys Ser Lys Asp Trp Pro Cys Asn Tyr Asp 
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80 85 90 

tgg gat teg gac gat gat gca gag gtt gag get ate etc aat tea ggt 688 
Trp Asp Ser Asp Asp Asp Ala Glu Val Glu Ala lie Leu Asn Ser Gly 

95 100 105 

get arg ggt tat tee gee eet taagtarate tgaggcagac ccttgggggt 739 
Ala Xaa Gly Tyr Ser Ala Pro 
110 115 

gtaaaagaga gtcaeaggta eeeeaaggag tagatgccag ggteetaagt tgaaaatgmt 799 
gtcgattggg ggegggggae aetgtatttg atatttgtga teagtgatca ttgtteaact 859 
gcgaaataga gtgtttgctt ttgataatgg aaaattgtat tegttttaaa atteegtttg 919 
ttgagaataa eaatatgttt aaaaatataa ttgaaeaaat tttaaaaaaa aaaamccey 978 



<210> 346 
<211> 810 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
p,^222> 63 . .320 

^221> sig_peptide 

^■;^222> 63 . .179 

f^223> Von Heijne matrix 

U'l score 3.90000009536743 

S seq VLAIGLLHIVLLS/IP 

irk221> polyA^signal 
J <222> 771. .776 

l^f221> polyA_site 
:'j:222> 799. .810 

^^00> 346 

^Ijkgggaaeega teeegggceg ttgatettcg gecceaeacg aacageagag aggggeatea 60 
Ol&g atg aat gtk gge aea geg cae ags dag gtg aac cec aae acg egg 107 



Met Asn Val Gly Thr Ala His Xaa Xaa Val Asn Pro Asn Thr Arg 













-35 










-30 








-25 




gtk 


atg 


aac 


age 


cgt 


ggc 


ate 


tgg 


etc 


tee 


tac 


gtg 


ctg 


gee 


ate ggt 


155 


Val 


Met 


Asn 


Ser 


Arg 


Gly 


He 


Trp 


Leu 


Ser 


Tyr 


Val 


Leu 


Ala 


He Gly 












-20 










-15 










-10 




ete 


cte 


cae 


ate 


gtg 


ctg 


ctg 


age 


ate 


ecg 


ttt 


gtk 


agt 


gtc 


eet gtc 


203 


Leu 


Leu 


His 


He 
-5 


Val 


Leu 


Leu 


Ser 


He 
1 


Pro 


Phe 


Val 


Ser 


Val 


Pro Val 




gte 


tgg 


ace 


ete 


ace 


aac 


etc 


att 


eae 


aac 


atg 


ggc 


5 

atg 


tat 


ate tte 


251 


Val 


Trp 


Thr 


Leu 


Thr 


Asn 


Leu 


He 


His 


Asn 


Met 


Gly Met 


Tyr 


He Phe 






10 










15 










20 








ctg 


eae 


aeg 


gtg 


aag 


ggg 


aca 


cec 


ttt 


gag 


ace 


ecg 


gae 


cag 


ggc aag 


299 


Leu 


His 


Thr 


Val 


Lys 


Gly 


Thr 


Pro 


Phe 


Glu 


Thr 


Pro 


Asp 


Gin 


Gly Lys 




25 










30 










35 








40 





gcg agg ctg eta ace cae tgg tgagcagatg gattatgggg tceagtteac 350 
Ala Arg Leu Leu Thr His Trp 
45 

ggcetctcgg aakttettga ccatcaeacc catcgtgetg tacttcctea ceagcttcta 410 
caetaaktac raceaaatec attttgtgct caacaccgtg tccetgatra gegtgettat 470 
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ccccaagctg ccccagctcc acggaktccg gatttttgga atcaataakt actgaaaktg 53 0 

casccccttc ccctgcccag ggtggcaggg gaggggtagg gtaaaaggca tktgctgcaa 590 

chctgaaaac araaaraara rscctctgga cactgccara ratgggggtt gagcctctgg 650 

cctaatttcc cccctcgctt cccccagtag ccaacttgga gtagcttgta ytggggttgg 710 

ggtaggcccc ctgggctctg accttttctg aattttttga tcttttcctt ttgctttttg 770 

aatararact ccatggagtt ggtcatggaa aaaaaaaaaa 810 



<210> 347 
<211> 771 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 299. .418 

<221> sig_peptide 

<222> 299. .379 

<223> Von Heijne matrix 
^ score 3.59999990463257 

I seq LTLLLITPSPSPL/LF 



f^<221> polyA__signal 
^"^<222> 739. .744 

4S<221> polyA^site 
y,|<222> 762. .771 



<400> 347 

|,:accttgggct ccaaattcta gctcataaag atgcaagtkt tgcaatttcc tataaatggt 60 

I'^ taagaaaaga gcaagctgtc cagagagtga gaagtttgaa aagagaggtg cataagagag 120 

ij^aatgatgtc catttgagcc ccaccacgga ggttatgtgg tcccaaaagg aatgatggcc 180 

; pagcaattaa tttttcctcc tagttcttag cttgcttctg cattgattgg ctttacacaa 240 

'.Jptggcattta gtctgcatta cacaaataga cactaattta tttggaacaa gcagcaaa 298 

;ff tg aga act tta ttt ggt gca gtc agg get cca ttt agt tec etc act 346 
i^Het Arg Thr Leu Phe Gly Ala Val Arg Ala Pro Phe Ser Ser Leu Thr 
-25 -20 -15 

ctg ctt eta ate ace cct tct ccc age cct ctt eta ttt gat aga ggt 394 
Leu Leu Leu lie Thr Pro Ser Pro Ser Pro Leu Leu Phe Asp Arg Gly 

-10 -5 15 

Ctg tee etc aga tea gca atg tct tagcccctct ectctcttce attcettcct 448 
Leu Ser Leu Arg Ser Ala Met Ser 
10 

gttggtactc atttctteta acttttaata aacatttagg tataatacat taeagtaagt 508 

gctatttaga taeaaactta aaacatacta tatattttaa ggatctaaga atcctttara 568 

rrrggeacat gactgaagta cctcagctgc gcageetgta aceagttttt ttaatgtaaa 628 

agtaaraatg ccagcettaa ectabeectg carataaaag ctaaetttta ttaataceag 688 

cectgaataa tggcactaat ecacaetctt ccttaragtg atgctggaaa aataaaatea 748 

^gg^cttcag attaaaaaaa aaa 773_ 



<210> 348 
<211> 409 
<212> DNA 
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<213> Homo sapiens 

<220> 
<221> CDS 
<222> 186. .380 

<221> sig_peptide 
<222> 186. .233 
<223> Von Heijne matrix 
score 4 

seq FFLFLSFVLMYDG/LR 

<221> polyA_signal 
<222> 383. ,388 

<221> polyA_site 
<222> 396. ,409 

<400> 348 

ataaaagaag cagcaaatag aatttcccac aaagtaagtt gactctaaat cttaagtatt 60 

acctagtttt ttaaaggttt gaatataata atgcagtatt tgcagtataa aaaggaagga 120 

l«jptttgtagag aatcattttg gtgctcaagt ctcttagcag tgccttattg cctcatagca 180 

l^gaag atg ctg ggg ttt ttt ttg ttt ttg tec ttt gta tta atg tat gat 230 
Met Leu Gly Phe Phe Leu Phe Leu Ser Phe Val Leu Met Tyr Asp 

f;^gt ttg cgc ctt ttt ggc att ctt tea aca tgt cgt gta cat cac acc 278 
'iply Leu Arg Leu Phe Gly He Leu Ser Thr Cys Arg Val His His Thr 

15 10 15 

y.ktg aat cag ttc eta att gat ata tct age ttt acc tec cga gtt aaa 326 
f/Met Asn Gin Phe Leu He Asp He Ser Ser Phe Thr Ser Arg Val Lys 
=i 20 25 30 

L,ii^aa aaa ate ttt tta ttt tat gee ttc awa ggt tge ycg ttt car agt 3 74 

lljLys Lys He Phe Leu Phe Tyr Ala Phe Xaa Gly Cys Xaa Phe Gin Ser 
U 3 5 40 45 

; gee aca taaataaaat gtttaacaaa aaaaaaaaa 409 
';|^la Thr 



<210> 349 
<211> 613 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 69. .458 

<221> sig__peptide 
<222> 69. .233 
<223> Von Heijne matrix 
score 4 

seq AALCGISLSQLFP/EP 

<221> polyA_signal 
<222> 564 . . 569 
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<221> polyA_site 
<222> 602 . .613 

<400> 349 

aagaacctga gcagcctgtc ttcagacaga gagaggccca cggctgtttc ttgaaaytgg 60 
cgctggga atg gcc atg tgg aac agg cca tgb bag ang ctg cct cag cag 110 

Met Ala Met Trp Asn Arg Pro Xaa Xaa Xaa Leu Pro Gin Gin 

-55 -50 -45 



cct 


cts 


sta 


get 


gag 


ccc 


act 


gca 


gag 


ggg 


gag 


eea 


cac 


ctg 


ccc 


aeg 


158 


Pro 


Leu 


Xaa 


Ala 


Glu 


Pro 


Thr 


Ala 


Glu 


Gly 


Glu 


Pro 


His 


Leu 


Pro 


Thr 






-40 










-35 










-30 












ggc 


egg 


gas 


byg 


act 


gag 


gcc 


aac 


cge 


ttc 


gcc 


tat 


get 


gcc 


etc 


tgt 


206 


Gly 


Arg 


Xaa 


Xaa 


Thr 


Glu 


Ala 


Asn 


Arg 


Phe 


Ala 


Tyr 


Ala 


Ala 


Leu 


Cys 




-25 










-20 










-15 










-10 




ggc 


ate 


tec 


ctg 


tee 


cag 


tta 


ttt 


cct 


gaa 


ccc 


gaa 


cac 


age 


tec 


ttc 


254 


Gly 


He 


Ser 


Leu 


Ser 
-5 


Gin 


Leu 


Phe 


Pro 


Glu 


Pro 


Glu 


His 


Ser 


Ser 


Phe 




tgc 


aca 


gag 


ttc 


atg 


gca 


ggc 


ctg 


gtg 


1 

ckm 


tgg 


etg 


gag 


5 

ttg 


tct 


gaa 


302 


Cys 


Thr 


Glu 


Phe 


Met 


Ala 


Gly 


Leu 


Val 


Xaa 


Trp 


Leu 


Glu 


Leu 


Ser 


Glu 








10 










15 










20 










get 


gtc 


ttg 


cca 


acc 


atg 


act 


get 


ttt 


geg 


age 


gge 


ctg 


gga 


ggt 


gaa 


350 


,.,^la 


Val 


Leu 


Pro 


Thr 


Met 


Thr 


Ala 


Phe 


Ala 


Ser 


Gly 


Leu 


Gly 


Gly 


Glu 






25 










30 










35 














sea 


vma 


tgt 


gtt 


tgt 


tea 


aat 


ttt 


act 


gaa 


gga 


ccc 


cat 


ctt 


gaa 


398 




Xaa 


Xaa 


Cys 


Val 


Cys 


Ser 


Asn 


Phe 


Thr 


Glu 


Gly 


Pro 


His 


Leu 


Glu 














45 










50 










55 




IJJga 


cga 


ccc 


gac 


ggt 


gat 


cac 


tea 


gga 


cct 


tct 


gag 


ctt 


etc 


act 


caa 


446 


:|fely 


Arg 


Pro 


Asp 


Gly 


Asp 


His 


Ser 


Gly 


Pro 


Ser 


Glu 


Leu 


Leu 


Thr 


Gin 












60 










65 










70 







y'Hga tgg gca eta tgaesccegg gecagagtee tegtttgcca catgacctce 498 
Gly Trp Ala Leu 



fijPtgctccaag tgeecttgga ggagctggat gtecttgaaa agatgttect ggagagcctg 558 
.^!aaggaaatca aagaagagga atctgaaatg gecgaggcat eccraaaaaa aaaaa 613 



0ik210> 350 
<211> 986 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 12 . .638 

<221> sig_peptide 

<222> 12 . .263 

<223> Von Heijne matrix 

score 4.19999980926514 
seq ITMLQMLALLGYG/LF 

<221> polyA_signal 
<222> 951. .956 

<221> polyA__site 
<222> 975 . . 985 
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<400> 350 

accctatcaa g atg gtc aac ttc ccc cag aaa att gca ggt gaa etc tat 50 
Met Val Asn Phe Pro Gin Lys lie Ala Gly Glu Leu Tyr 
-80 -75 



gga 


cct 


etc 


atg 


ctg 


gtc 


ttc 


act 


ctg 


gtt 


get 


ate 


eta 


etc 


eat 


ggg 


98 


Gly 


Pro 


Leu 


Met 


Leu 


Val 


Phe 


Thr 


Leu 


Val 


Ala 


He 


Leu 


Leu 


His 


Gly 






-70 










-65 










-60 










atg 


aag 


acg 


tct 


gac 


act 


att 


ate 


egg 


gag 


ggc 


acc 


ctg 


atg 


ggc 


aca 


146 


Met 


Lys 


Thr 


Ser 


Asp 


Thr 


He 


He 


Arg 


Glu 


Gly Thr Leu Met 


Gly 


Thr 




-55 










-50 










-45 










-40 




gcc 


att 


ggc 


acc 


tgc 


ttc 


ggc 


tac 


tgg 


ctg 


gga gtc 


tea 


tec 


ttc 


att 


194 


Ala 


lie 


Gly 


Thr 


Cys 


Phe 


Gly 


Tyr 


Trp 


Leu 


Gly Val 


Ser 


Ser 


Phe 


He 












-35 










-30 










-25 






tac 


ttc 


ctt 


gcc 


tac 


ctg 


tgc 


aac 


gcc 


cag 


ate 


acc 


atg 


ctg 


cag 


atg 


242 


Tyr 


Phe 


Leu 


Ala 


Tyr 


Leu 


Cys 


Asn 


Ala 


Gin 


He 


Thr 


Met 


Leu 


Gin 


Met 










-20 










-15 










-10 








ttg 


gca 


ctg 


ctg 


ggc 


tat 


ggc 


etc 


ttt 


ggg 


eat 


tgc 


att 


gtc 


ctg 


ttc 


290 


Leu 


Ala 


Leu 
-5 


Leu 


Gly 


Tyr 


Gly 


Leu 
1 


Phe 


Gly 


His 


Cys 


He 


Val 


Leu 


Phe 




ate 


acc 


tat 


aat 


ate 


cac 


etc 


cgc 


gcc 


etc 


ttc 


5 

tac 


etc 


ttc 


tgg 


ctg 


338 




Thr 


Tyr 


Asn 


He 


His 


Leu 


Arg 


Ala 


Leu 


Phe 


Tyr 


Leu 


Phe 


Trp 


Leu 














15 










20 










25 




Jijitg 


gtg 


ggt 


gga 


ctg 


tec 


aca 


ctg 


cgc 


atg 


gta 


gca 


gtg 


ttg 


gtg 


tct 


386 


J'%eu 


Val 


Gly 


Gly 


Leu 


Ser 


Thr 


Leu 


Arg 


Met 


Val 


Ala 


Val 


Leu 


Val 


Ser 












30 










35 










40 








acc 


gtg 


ggc 


ccc 


aca 


cad 


egg 


mtg 


etc 


etc 


tgt 


ggc 


ace 


ctg 


get 


434 


;|lArg 


Thr 


Val 


Gly 


Pro 


Thr 


Xaa 


Arg 


Xaa 


Leu 


Leu 


Cys 


Gly Thr 


Leu 


Ala 










45 










50 










55 








y%cc 


eta 


cac 


atg 


etc 


ttc 


ctg 


etc 


tat 


ctg 


cat 


ttt 


gcc 


tac 


cac 


aaa 


482 


- Ala 


Leu 


His 


Met 


Leu 


Phe 


Leu 


Leu 


Tyr 


Leu 


His 


Phe 


Ala 


Tyr 


His 


Lys 








60 










65 










70 








^1 dtg 


gta 


dag 


ggg 


ate 


ctg 


gac 


aca 


ctg 


gag 


ggc 


ccc 


aac 


ate 


ccg 


ccc 


530 


:^5caa 


Val 


Xaa 


Gly 


He 


Leu 


Asp 


Thr 


Leu 


Glu 


Gly Pro Asn 


He 


Pro 


Pro 






75 










80 










85 












'''4tc 


cag 


agg 


gtc 


ccc 


aga 


gac 


ate 


cct 


gcc 


atg 


etc 


cct 


get 


get 


egg 


578 


yfcie 


Gin 


Arg 


Val 


Pro 


Arg 


Asp 


He 


Pro 


Ala 


Met 


Leu 


Pro 


Ala 


Ala 


Arg 




0390 










95 










100 










105 




ctt 


ccc 


acc 


acc 


gtc 


etc 


aac 


gcc 


aca 


gcc 


aaa 


get 


gtt 


geg 


gtg 


ace 


626 


Leu 


Pro 


Thr 


Thr 


Val 


Leu 


Asn 


Ala 


Thr 


Ala 


Lys 


Ala 


Val 


Ala 


Val 


Thr 





110 115 120 



Ctg cag tea cac tgaecccacc tgaaattctt ggecagtcet ctttcecgea 678 
Leu Gin Ser His 
125 

getgcagaga ggargaasac tattaaagga eagtcctgat gacatgtttc gtagatgggg 738 

tttgcagctg ccaetgagct gtagctgcgt aagtaectcc ttgatgcctg tcggcacttc 798 

tgaaaggcae aaggecaaga actcctggcc aggactgeaa ggctetgcag ecaatgeaga 858 

aaatgggtca geteetttga gaacccctec ccacctaccc cttccttcct ctttatctct 918 

eccacattgt cttgctaaat atagacttgg taattaaaat gttgattgaa gtetggaaaa 978 

aaaaaaat qq^^ 



<210> 351 
<211> 1447 
<212> DNA 

<213> Homo sapiens 
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<220> 
<221> CDS 
<222> 282 . .389 

<221> sig_peptide 
<222> 282 . .332 
<223> Von Heijne matrix 
score 3 . 5 

seq RWWCFHLQAEASA/HP 

<221> polyA_signal 
<222> 1413. .1418 

<221> polyA_site 
<222> 1437. .1447 

<400> 351 

ataataatat ctaaaaagct aaattttaaa taccagcttt acataaatga ttgtkgactc 60 

tggtctgtkt ctgacacctt tccagaaaaa agtcaattgt tcaggtacac caaagaggaa 120 

gaagagctgt ggaggccacc ctctacaaag ctttatagaa cttctggatc taactcacaa 180 

r^j^caagcttcc agaagagact agagacctta ggccaggaga tgaaggagtt cagtagcaaa 240 

^I'^^tcacacctg tccaattccc tgagctttgc tcactcagct a atg gga tgg caa agg 296 

Met Gly Trp Gin Arg 

njgg tgg tgc ttt cat ctt cag gca gaa gcc tct gcc cat ccc cct caa 344 
'•^I'rp Trp Cys Phe His Leu Gin Ala Glu Ala Ser Ala His Pro Pro Gin 

-|p -10 -5 1 

=).§gg ctg cag gcc caa ttc tea tgc tgc cct tgg gtg ggc ate tgt 389 
J'fely Leu Gin Ala Gin Phe Ser Cys Cys Pro Trp Val Gly lie Cys 
i 5 10 15 

,|taacaaadga aaacgtctgg gtggcggcag casctttgct ctgagtgcct acaaagctaa 449 

Ipgcttggtgc tagaaacatc atcattatta aacttcagaa aagcagcagc catgttcagt 509 

^fcaggctcatg ctgectcact gcttaagtgc etgcaggagc cgcctgccaa rctcccctte 569 

ptaeacctgg cacactgggg tetgcacaag gctttgtcaa ecaaaracag cttcccccww 629 

Jtgattgect gtagactttg gagccaaraa acactctgtg tgactctaca cacacttcag 689 

4|^tggtttgtg cttcaaagtc attgatgcaa cttgaaagga aacagtttaa tggtggaaat 749 

:,^.gaactaccat ttataacttc tgttttttta ttgagaaaat gattcacgaa kkccaaatca 809 

gattgecagg aagaaatagg acgtgaeggt actgggccct gtgattctcc cageccttgc 869 

agtccgctag gtgagaggaa aagctcttta cttccgcccc tggcagggac ttctgggtta 929 

tgggagaaac cagagatggg aatgaggaaa atatgaacta cagcagaagc ccctgggcag 989 

ctgtgatgga gcccctgaca ttactcttct tgcatctgtc ctgccttctt tccctctgcg 1049 

aggcagtggg gtgggattca gagtgcttag tctgctcact gggagaagaa gagttcctgc 1109 

gcatgcaagc cetgctgtgt ggctgtcgtt tacatttggg aggtgtcctg tatgtctgta 1169 

cgttggggac tgcctgtatt tggaagattt aaaaacctag catcctgttc tcaccctcta 122 9 

agctgcattg agaaatgact cgtctctgta tttgtattaa gccttaacac ttttcttaag 1289 

tgcattcggt gccaacattt tttagagctg taccaaaaca aaaagcctgt actcacatca 1349 

camtgtcatt ttgataggag cgttttgtta tttttacaag gcagaatggg gtgtaacagt 1409 

tgaattaaac ttagcaatea egtgctcaaa aaaaaaaa 144 7 



<210> 352 
<211> 1641 
<212> DNA 

<213> Homo sapiens 
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<220> 
<221> CDS 
<222> 208. .339 

<221> sigjeptide 

<222> 208 . .294 

<223> Von Heijne matrix 

score 5.59999990463257 

seq LFLQLLVSHEIVC/AT 

<221> polyA_site 
<222> 1631. .1641 

<400> 352 

agaaccgtga tgggaagatg gacaaggaag agaccaaaga ctggatcctt ccctcagact 60 
atgatcatgc agaggcagaa gccaggcacc tggtctatga atcagaccaa aacaaggatg 120 
gcaagcttac caaggaggag atcgttgaca agtatgactt atttgttggc agccaggcca 180 
cagattttgg ggaggcctta gtacggc atg atg agt tct gag eta egg agg aac 234 

Met Met Ser Ser Glu Leu Arg Arg Asn 
-25 

cct cat ttc etc aaa agt aat tta ttt tta cag ctt ctg gtt tea cat 282 
.,|>ro His Phe Leu Lys Ser Asn Leu Phe Leu Gin Leu Leu Val Ser His 
pr20 -15 -10 -5 

J'^aa att gtt tgc get act gag act gtt act aca aac ttt tta aga eat 33 0 

'•;Glu He Val Cys Ala Thr Glu Thr Val Thr Thr Asn Phe Leu Arg His 

^ ^ 10 

rJ^aa aag gcg taatgaaaac catcccgtcc ceatteetcc tcctctctga 3 79 
|251u Lys Ala 

J^ggactggag ggaagccgtg cttctgagga acaaetctaa ttagtacaet tgtgtttgta 439 

; ratttacacw wtgtattatg tattaacatg gcgtgtttat ttttgtattt ttctctggtt 499 

.s^^ggagtatka tatgaaggat caaratccte aaetcacaea tgtaracaaa cattasetet 559 

;r^|:tactctttc teaaccectt wtatgatttt aataattetc aettaaetaa ttttgtaage 619 

■ptgagatcaa taagaaatgt teaggagaga ggaaagaaaa aaaatatatg etceaeaatt 679 

■^^atatttaga gagagaacac ttagtettge ctgteaaaaa gteeaacatt teataggtag 739 

'jtaggggecac atattacatt cagttgctat aggtceagca actgaacctg ecattacctg 799 

iggcaaggaaa gatcectttg ctetaggaaa gettggceca aattgatttt ettettttte 859 

;:'fcccctgtagg aetgaetgtt ggctaatttt gteaagcaca gctgtggtgg gaagagttag 919 

ggccagtgtc ttgaaaatea atcaagtagt gaatgtgatc tetttgeara gctatagata 979 

gaaacagctg gaaaactaaa ggaaaaatac aagtgttttc ggggeataca ttttttttet 1039 

gggtgtgcat ctgttgaaat gctcaagact taattatttg ecttttgaaa tcaetgtaaa 1099 

tgcceecate cggtteetct tcttccearg tgtgecaagg aattaatctt ggtttcacta 1159 

caattaaaat tcactecttt ccaatcatgt cattgaaagt gectttaacg aaagaaatgg 1219 

tcactgaatg ggaattctct taagaaaecc tgagattaaa aaaagaetat ttggataact 12 79 

tataggaaag ectagaaect eccagtagag tggggatttt tttettctte cetttctett 1339 

ttggacaata gttaaattag cagtattagt tatgagtttg gttgcagtgt tettatettg 1399 

tgggctgatt tccaaaaaec acatgetget gaatttaeca gggatectca tacetcaeaa 1459 

tgeaaaecac ttaetaceag gectttttet gtgtccactg gagagettga gctcacaetc 1519 

aaagatcaga ggaeetacag agagggetct ttggtttgag gaecatggct tacctttcct 1579 

gectttgaee catcacacce catttcetcc tctttccetc tecccgetgc caaaaaaaaa 1639 

1641 



<210> 353 
<211> 884 
<212> DNA 
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<213> Homo sapiens 

<220> 
<221> CDS 
<222> 69. .557 

<221> sig_peptide 

<222> 69. .224 

<223> Von Heijne matrix 

score 4.69999980926514 

seq LGLALGRLEGGSA/RH 

<221> polyA__signal 
<222> 849. .854 

<221> polyA_site 
<222> 870. .883 

<400> 353 

attggctccg gatcgtgcgt gaggcggctt cgtgggcagc gagagtcaca gacaagacag 60 
caagcagg atg gag cac tac egg aaa get ggc tct gta gag etc cca gcg 110 
r;| Met Glu His Tyr Arg Lys Ala Gly Ser Val Glu Leu Pro Ala 

% -50 -45 -40 

'ffpct tec cea atg ccc cag eta cet cct gat ace ett gag atg egg gte 158 
I'pTO Ser Pro Met Pro Gin Leu Pro Pro Asp Thr Leu Glu Met Arg Val 

"^^ -25 
gge age aaa att cge aac etg ctg ggg ttg get etg ggt egg 206 
;rArg Asp Gly Ser Lys lie Arg Asn Leu Leu Gly Leu Ala Leu Gly Arg 
W -20 -15 -10 

IJlhtg gag gge ggc agt get egg eat gta gtg ttc tea ggt tct gge agg 254 
I Leu Glu Gly Gly Ser Ala Arg His Val Val Phe Ser Gly Ser Gly Arg 

-5 1 5 10 

lect gca gga aag get gtc age tge get gag att gte aag egg egg gtc 3 02 

^^^la Ala Gly Lys Ala Val Ser Cys Ala Glu He Val Lys Arg Arg Val 

15 20 25 

'iP^9 ggc ctg cac cag etc ace aag eta ekt ttc ett caa act gag gac 350 
;j4>ro Gly Leu His Gin Leu Thr Lys Leu Xaa Phe Leu Gin Thr Glu Asp 
H 30 35 40 

age tgg gte cca sec tea ect gae aca ggg eta rac ccc etc aca gtg 398 
Ser Trp Val Pro Xaa Ser Pro Asp Thr Gly Leu Xaa Pro Leu Thr Val 

45 50 55 

cge cge cat gtg cet gca ktg tgg gtg ctg etc asc egg gac ccc ctg 446 
Arg Arg His Val Pro Ala Xaa Trp Val Leu Leu Xaa Arg Asp Pro Leu 

60 65 70 

gac ccc aat gag tgt ggt tac caa ccc cca gga gca cce cct gge ctg 494 
Asp Pro Asn Glu Cys Gly Tyr Gin Pro Pro Gly Ala Pro Pro Gly Leu 
75 80 85 90 

ggt tec atg ccc age tee age tgt ggc ect cgt tee era aaa agg get 542 
Gly Ser Met Pro Ser Ser Ser Cys Gly Pro Arg Ser Xaa Lys Arg Ala 

95 100 105 

era rac ace ega teg tgaaaacctg ctgaseeage etgttcteeg ggectraatg 597 
Xaa Xaa Thr Arg Ser 
110 

tetggggtgc ttgtgeettt tctranaagc gttgtgaskg etcaaeatec ecatcaaggt 657 
ttgagtceae aaaagtggac etceetatea tgettceect tecctctage atgtgggaag 717 
ggactgetgt gaagaatgae agatgtgggg cetetgecaa gttctgeatt getaaataag 777 
ggcttcetet gcettetaec tacagtgeat ttgaactgce ttetgaaaga ggteeakgga 83 7 



-293- 

gggatttagg aaataaagtt tctacctatt tgaaaaaaaa aaaacac 



884 



<210> 354 
<211> 729 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 134 . .325 

<221> sig_peptide 

<222> 134. .274 

<223> Von Heijne matrix 

score 5.90000009536743 

seq TWLGLLSFQNLHC/FP 

<221> polyA_site 
<222> 718. .729 

^=4400> 354 

J;s4tcattttct tatccctgct gatttcaaac cttcccatgg tttagaagca taacctgtaa 60 
^^Agtaatgcaa gtcccctaac tccctggttg ctaacattaa cttccttaag taataatcaa 12 0 
rtgaaagavat tct atg cat ggt ttt gaa ata ata tec ttg aaa gag gaa 169 
Ul Met His Gly Phe Glu lie lie Ser Leu Lys Glu Glu 

-45 -40 
ytca cca tta gga aag gtg agt cag ggt cct ttg ttt aat gtg act agt 217 
[j^er Pro Leu Gly Lys Val Ser Gin Gly Pro Leu Phe Asn Val Thr Ser 
.-35 -30 -25 -20 

L^gc tea tea tea cca gtg ace tgg ttg ggc eta etc tec ttc cag aac 265 
^;^piy Ser Ser Ser Pro Val Thr Trp Leu Gly Leu Leu Ser Phe Gin Asn 
ll -15 -10 -5 

["ptg cat tgc ttc cca gac etc ccc act gag atg cct eta ara gcc aaa 313 
'Jj^eu His Cys Phe Pro Asp Leu Pro Thr Glu Met Pro Leu Xaa Ala Lys 
yjl 1 5 10 

O'igga ktc aac act tgagcetagg gtgggctaca acaaaaratt etaatttacc 365 
Gly Xaa Asn Thr 
15 

ttgettcatc taggtccagg cceeaaktag ettgetgaag gaacttaaaa agtagetgtt 425 
atttattgta ttgtataasc taaaaacatt tatttttgtt gaatcraaac aattceatgt 485 
ascaatettt tttctgttca cggtgtttgt gataaaaeet taaattccgc aagcatcagt 545 
tttttgaaaa aatgggaatt gaeeggatag wwacaggcaa agwtataaat agctacaaca 605 
tcatttaact tttataaaca tgcettctet ctattgaara eatctgatat ttttgctgga 665 
aagttggate tatecteagt aactctgcca tgaattcetg tttcckggtt ceaaaaaaaa 725 
aaaa n o q 



<210> 355 

<211> 1013 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
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<222> 78. .731 

<221> sigjpeptide 

<222> 78 . .227 

<223> Von Heijne matrix 

score 5.09999990463257 

seq RTALILAVCCGSA/SI 

<221> polyA_site 
<222> 1002. .1013 

<400> 355 

agtttccaag ggaaggagca gcgtgtggga aagcacagaa gagtgagaag gaagcgacta 60 
aattttattt actttct atg cat cat ggc etc aca cca ctg tta ctt ggt 110 

Met His His Gly Leu Thr Pro Leu Leu Leu Gly 
-50 -45 -40 

gta cat gag caa aaa cag caa gtg gtg aaa ttt tta ate aag aaa aaa 158 
Val His Glu Gin Lys Gin Gin Val Val Lys Phe Leu He Lys Lys Lys 

-35 -30 -25 

gca aat tta aat gca ctg gat aga tat gga aga act get etc ata ctt 2 06 

Ala Asn Leu Asn Ala Leu Asp Arg Tyr Gly Arg Thr Ala Leu He Leu 

CI -10 

'Tfict gta tgt tgt gga teg gca agt ata gte age ctt eta ctt gag caa 254 

::;i.la Val Cys Cys Gly Ser Ala Ser He Val Ser Leu Leu Leu Glu Gin 

[f -5 1 5 

Hi?^^ c:ta tct gga cag acg gcc aaa aag 3 02 

^^^Asn He Asp Val Ser Ser Gin Asp Leu Ser Gly Gin Thr Ala Lys Lys 

;|i-0 15 20 25 

y,tat get gtt tct agt cgt cat aat gta att tge cag tta ctt tct gae 350 

[Jfyr Ala Val Ser Ser Arg His Asn Val He Cys Gin Leu Leu Ser Asp 

30 35 40 

|,|tac aaa raa aaa cag atr eta aaa gte tct tct gaa aac age aat cca 3 98 

hfyr Lys Xaa Lys Gin Xaa Leu Lys Val Ser Ser Glu Asn Ser Asn Pro 

U 50 55 

!;/F^^ 9^^ 9^9 tea caa agg ctt aaa 446 

'iJkaa Gin Asp Leu Lys Leu Thr Ser Glu Glu Glu Ser Gin Arg Leu Lys 

i:sfega agt gaa aat age cag cca gag gaa atg tct caa gaa cca gaa ata 494 
Gly Ser Glu Asn Ser Gin Pro Glu Glu Met Ser Gin Glu Pro Glu He 

75 80 85 

aat arg ggt ggt gat aga aag gtt gaa raa raa atg aar aag cae gga 542 
Asn Xaa Gly Gly Asp Arg Lys Val Glu Xaa Xaa Met Lys Lys His Gly 
90 95 100 105 

agt wet cat atg gga ttc cca raa aac ctg met aac ggt gcc act get 590 
Ser Xaa His Met Gly Phe Pro Xaa Asn Leu Xaa Asn Gly Ala Thr Ala 

110 115 120 

gae aat ggt gat gat gga tta att ccm cca rgg aaa ase ara aca cet 63 8 

Asp Asn Gly Asp Asp Gly Leu He Pro Pro Xaa Lys Xaa Xaa Thr Pro 

125 130 135 

gaa age cas caa ttt ect gae act gag aat gaa cag tat cae agg gae 686 
Glu Ser Xaa Gin Phe Pro Asp Thr Glu Asn Glu Gin Tyr His Arg Asp 

140 145 150 

ttt tct ggc cat ecc mac ttt ccc aed acc ctt ccc ate aaa cag 731 
Phe Ser Gly His Pro Xaa Phe Pro Thr Thr Leu Pro He Lys Gin 

155 160 165 

tgatgaaeaa aatgatacte hsaagcmmct ttctgaagam caraaeactg gaatattaca 791 
agatgagatt etgattcatg aagaaaagca gatagaagtg getgaaaatg aattctgage 851 
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tttctcttag ttataaraaa gaaaaagacc tcttgcatga aaatagtacg ttgcaggaag 911 

aaattgtcat gctaaractg gaactagack taatgaaaca tcagagccag ctaararaaa 971 

araaatattt ggaggaaatt gaaagtgtgg aaaaaaaaaa aa 1013 



<210> 356 
<211> 973 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 46. .693 

<221> sig_peptide 

<222> 46. .90 

<223> Von Heijne matrix 

score 7.59999990463257 

seq CVLVLAAAAGAVA/VF 

.p,^221> polyA_signal 
'"^222> 937 , .942 

l'^221> polyA^site 
^'^222> 962 . .973 

4Ec400> 356 

y,kagcggctgg tccccggaag ttggacgcat gcgccgtttc tctgc atg gtg tgc gtt 57 
U1 Met Val Cys Val 

!ii -15 





gtt 


eta 


get 


gcg 


gee 


gca 


gga 


get 


gtg 


gcg 


gtt 


ttc 


eta 


ate 


ctg 


105 


ijLeu 


Val 
-10 


Leu 


Ala 


Ala 


Ala 


Ala 
-5 


Gly 


Ala 


Val 


Ala 


Val 


Phe 


Leu 


He 


Leu 




pga 


ata 


tgg 


gta 


gtg 


ctt 


cgt 


tec 


atg 


gae 


gtt 


1 

acg 


ccc 


egg 


gag 


5 

tct 


153 




He 


Trp 


Val 


Val 


Leu 


Arg 


Ser 


Met 


Asp 


Val 


Thr 


Pro Arg 


Glu 


Ser 












10 










15 










20 






Jfctc 


agt 


ate 


ttg 


gta 


gtg 


get 


ggg 


tec 


ggt 


ggg 


cat 


acc 


act 


gag 


ate 


201 


Leu 


Ser 


He 


Leu 


Val 


Val 


Ala 


Gly 


Ser 


Gly 


Gly His 


Thr 


Thr 


Glu 


He 










25 










30 










35 








ctg 


agg 


ctg 


ctt 


ggg 


age 


ttg 


tec 


aat 


gee 


tac 


tea 


cct 


aga 


eat 


tat 


249 


Leu 


Arg 


Leu 


Leu 


Gly 


Ser 


Leu 


Ser 


Asn 


Ala 


Tyr 


Ser 


Pro 


Arg 


His 


Tyr 








40 










45 










50 






gtc 


att 


get 


gae 


act 


gat 


gaa 


atg 


agt 


gee 


aat 


aaa 


ata 


aat 


tct 


ttt 


297 


Val 


He 


Ala 


Asp 


Thr 


Asp 


Glu 


Met 


Ser 


Ala 


Asn 


Lys 


He 


Asn 


Ser 


Phe 






55 










60 










65 












gaa 


eta 


rat 


cga 


gsk 


gat 


aga 


rac 


cct 


agt 


aac 


atg 


twt 


aec 


aaa 


tac 


345 


Glu 


Leu 


Xaa 


Arg 


Xaa 


Asp 


Arg 


Xaa 


Pro 


Ser 


Asn 


Met 


Xaa 


Thr 


Lys 


Tyr 




70 










75 










80 








85 




tac 


att 


cac 


cga 


att 


cca 


ara 


age 


egg 


gag 


gtt 


cag 


cag 


tec 


tgg 


ccc 


393 


Tyr 


He 


His 


Arg 


He 


Pro 


Xaa 


Ser 


Arg 


Glu 


Val 


Gin 


Gin 


Ser 


Trp 


Pro 












90 










95 










100 






tec 


acc 


gtt 


tyc 


aec 


acc 


ttg 


cac 


tee 


atg 


tgg 


etc 


tee 


ttk 


ccc 


eta 


441 


Ser 


Thr 


Val 


Xaa 


Thr 


Thr 


Leu 


His 


Ser 


Met 


Trp 


Leu 


Ser 


Xaa 


Pro 


Leu 










105 










110 










115 








att 


cac 


agg 


gtg 


aag 


cca 


rat 


ttg 


gtg 


ttg 


tgt 


aac 


gga 


eea 


gga 


aca 


489 


He 


His 


Arg 


Val 


Lys 


Pro 


Xaa 


Leu 


Val 


Leu 


Cys 


Asn 


Gly Pro 


Gly 


Thr 
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120 










125 










130 










tgt 


gty 


cct 


ate 


tgt 


gta 


tet 


gcc 


ett 


etc 


ett 


ggg 


ata 


eta 


gga 


ata 


537 


Cys 


Val 
135 


Pro 


He 


Cys 


Val 


Ser 
140 


Ala 


Leu 


Leu 


Leu 


Gly 
145 


He 


Leu 


Gly 


He 




aag 


aaa 


gtg 


ate 


att 


gtc 


tac 


gtt 


gaa 


age 


ate 


tgc 


cgt 


gta 


aaa 


acs 


585 


Lys 


Lys 


Val 


He 


He 


Val 


Tyr 


Val 


Glu 


Ser 


He 


Cys 


Arg 


Val 


Lys 


Thr 




150 










155 










160 










165 




tta 


tec 


atg 


tec 


gga 


aag 


att 


ctg 


ttt 


cat 


etc 


tea 


aat 


tac 


ttc 


att 


633 


Leu 


Ser 


Met 


Ser 


Gly 
170 


Lys 


He 


Leu 


Phe 


His 
175 


Leu 


Ser 


Asn 


Tyr 


Phe 
180 


He 




gtt 


cag 


tgg 


ecg 


get 


ctg 


aaa 


gaa 


aag 


tat 


ecc 


aaa 


teg 


gtg 


tac 


ett 


681 


Val" 


Gin 


Trp 


Pro 
185 


Ala 


Leu 


Lys 


Glu 


Lys 
190 


Tyr 


Pro 


Lys 


Ser 


Val 
195 


Tyr 


Leu 





ggg cga att gtt tgacaaatgg eaactgaett ctttagaatt ttgeasttaa 733 
Gly Arg He Val 
200 



eagtartatg taeteaaatt ggggggaaaa aaaccetaca tgtttettgt aaaggcgtct 793 

gacagtcctg araattattg atggtaagga ataaaaaatg twcagatrae tcagtgaara 853 

aactgaggct tctcttatga aacaaaeatt gataaaegta actacyaaat gtttatgcct 913 

ctgtaaacca aatttetttt etarataaaa atatgtatta ctaectgcaa aaaaaaaaaa 973 



;;f5<210> 357 

|!^J^211> 868 

f^2X2> DNA 

^tf k213> Homo sapiens 

yj<22 0> 
yi<221> CDS 

<222> 126. .527 

^^i|<221> sig_peptide 

li^^222> 126. .182 

;"'^223> Von Heijne matrix 

score 3.90000009536743 
'■■^'■^ seq ILFHGVFYAGGFA/IV 

<221> polyA__signal 
<222> 834. .839 

<221> polyA_site 
<222> 856. .867 

<400> 357 

aetggaagaa etcgtcatgc tetttgtage gtggtgcttc tgttgetcac aggaeaactt 60 
gectttgatg attttcaaga gagttgtgct atgatgtggc aaagtatgea ggaagcagge 12 0 
ggtea atg cct ctg gga gea agg ate ett ttc eac ggt gtg ttc tat gcc 170 
Met Pro Leu Gly Ala Arg He Leu Phe His Gly Val Phe Tyr Ala 
-15 -10 -5 

ggg ggc ttt gee att gtg tat tac etc att caa aag ttt cat tee agg 218 
Gly Gly Phe Ala He Val Tyr Tyr Leu He Gin Lys Phe His Ser Arg 

15 10 
act tta tat tac aag ttg gea gtg gar cag ctg car are cat ecc gag 266 
Thr Leu Tyr Tyr Lys Leu Ala Val Glu Gin Leu Gin Xaa His Pro Glu 

15 20 25 

gea cag gaa get ctg gge cct cct etc aac ate eat tat etc aag etc 314 
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x-iXo. 






Axa 


Leu 


(aXy 


Pro 


Pro 


Leu 


Asn 


lie 


His 


Tyr 


Leu 


Lys 


Leu 






^ U 










35 










40 














gac 




gaa 


aac 


-H 4- <^ 


gtg 


gac 


att 


gtt 


rat 


gcc 


aag 


ttg 


aaa 


att 


362 






Arg 




Asn 


File 


vax 


Asp 


lie 


Val 


Xaa 


Ala 


Lys 


Leu 


Lys 


lie 
























55 










60 




L- 


y 




gga 


tec 


aaa 


cca 


gag 


ggc 


ctt 


etc 


tac 


gtc 


cac 


tea 


tee 


410 


Pro 


Val 




Gly 


Seir 


Lys 




m n 

ox Ll 


v3xy 


Leu 


Leu 


Tyr 


V ai 


XJ-i <-t 

xllS 


Ser 


Ser 












65 










70 










75 






aga 


ggt 


ggc 


ccc 


ttt 


cag 


agg 


tgg 


cac 


ctt 


gac 


gag 


gtc 


ttt 


tta 


gag 


458 


Arg 


Gly 


Gly 


Pro 


Phe 


Gin 


Arg 


Trp 


His 


Leu 


Asp 


Glu 


Val 


Phe 


Leu 


Glu 










80 










85 










90 








etc 


aag 


gat 


ggt 


cag 


cag 


att 


cct 


gtg 


ttc 


aag 


etc 


agt 


ggg 


gaa 


aac 


506 


Leu 


Lys 


Asp 


Gly 


Gin 


Gin 


lie 


Pro 


Val 


Phe 


Lys 


Leu 


Ser 


Gly 


Glu 


Asn 





55 100 105 



ggt gat gaa gtg aaa aag gag tagagacgae ceagaagacc eagcttgctt 557 
Gly Asp Glu Val Lys Lys Glu 
110 115 

etagtccate ettecctcat ctetaecata tggeeactgg ggtggtggce catetcagtg 617 

acagacaetc etgcaaccca gkttteeagc eaceagtggg atgatggtat gtgeeageac 677 

atggtaattt tggtgtaatt etaacttggg eacaaegaat getatttgtc atttttaaac 737 

tgaatccgaa agaaactcct attataaatt taagataatg taatgtattt gaaagtgctt 797 

g^tgtataaaaa agcacatgat aaaaggaate agaattaata aaatgtttgt tgatctttaa 857 

'"JJIaaaaaaaaaa h 353 



iil:210> 358 

:^|^211> 519 

yic212> DNA 

Ij*k213> Homo sapiens 

|,l<220> 
^^<221> CDS 
r!<222> 66 . .320 

''J:221> sig_peptide 
y4c222> 66 . . 113 
^i^;k223> Von Heijne matrix 
score 3.5 

seq TALAAXTWLGVWG/VR 



<221> polyA_signal 
<222> 490. ,495 



<221> polyA_site 
<222> 508 . .519 



<400> 358 

aattagcgcg taaegcasag aetgcttgct gcggcagaga egceagakgt gcagctceag 60 
cagca atg gea gtg acg geg ttg gcg geg mrg aeg tgg ctt ggc gtg tgg 110 
Met Ala Val Thr Ala Leu Ala Ala Xaa Thr Trp Leu Gly Val Trp 
-15 -10 -5 

ggc gtg agg ace atg caa gcc cga ggc ttc ggc teg gat cag tec gag 158 
Gly Val Arg Thr Met Gin Ala Arg Gly Phe Gly Ser Asp Gin Ser Glu 

15 10 15 

aat gtc gac egg ggc geg ggc tec ate egg gaa gee ggt ggg gee ttc 206 
Asn Val Asp Arg Gly Ala Gly Ser lie Arg Glu Ala Gly Gly Ala Phe 
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20 25 30 

gga aag aga gag cag get gaa gag gaa cga tat ttc cga gca cag agt 254 
Gly Lys Arg Glu Gin Ala Glu Glu Glu Arg Tyr Phe Arg Ala Gin Ser 

35 40 45 

aca gaa caa ctg gca ret ttg aaa aaa crc cat gaa gaa gar ate gtt 302 
Thr Glu Gin Leu Ala Xaa Leu Lys Lys Xaa His Glu Glu Glu lie Val 

50 55 60 

cat cat aga gaa gga gat tgagcgtctg cagaaagaaa ttgagcgcca 350 
His His Arg Glu Gly Asp 
65 

taagcagaag atcaaaatgc tagaacatga tgattaagtg cacaccgtgt gccatagaat 410 
ggcacatgtc attgcccact tctgtgtaaa catggttctg gtttaactaa tatttgtctg 470 
tgtgctacta acagattata ataaattgtc atcagtgaaa aaaaaaaaa 519 



<210> 359 

<211> 1028 

<212> DNA 

<213> Homo sapiens 



f'^22 0> 
':J:221> CDS 
JJ1c222> 73. .948 

■'^221> sig_j)eptide 

^k222> 73 . .159 

4=5c223> Von Heijne matrix 

yj score 4.40000009536743 

ill seq IVLHLVLQGMVYT/EY 



siii<:221> polyA_site 
iK222> 1016 . . 1028 

jc400> 359 

.J^gctttaaag gcctggccag gggaggagca cagatatttt cctgtataat tccagaatgt 60 
Ij^pttcagagag cc atg cat gga ttg ctt cat tac ctt ttc cat acg aga aac 111 
^ri^ Met His Gly Leu Leu His Tyr Leu Phe His Thr Arg Asn 

-25 -20 



cac 


acc 


ttc 


att 


gtc 


ctg 


cac 


ctg 


gtc 


ttg 


caa 


ggg 


atg 


gtt 


tat 


act 


159 


His 


Thr 


Phe 


lie 


Val 


Leu 


His 


Leu 


Val 


Leu 


Gin 


Gly 


Met 


Val 


Tyr 


Thr 






-15 










-10 










-5 










gag 


tac 


acc 


tgg 


gaa 


gta 


ttt 


ggc 


tac 


tgt 


cag 


gag 


ctg 


gag 


ttg 


tec 


207 


Glu 


Tyr 


Thr 


Trp 


Glu 


Val 


Phe 


Gly 


Tyr 


Cys 


Gin 


Glu 


Leu 


Glu 


Leu 


Ser 




1 








5 










10 










15 






ttg 


cat 


tac 


ctt 


ctt 


ctg 


ccc 


tat 


ctg 


ctg 


eta 


ggt 


gta 


aac 


ctg 


ttt 


255 


Leu 


His 


Tyr 


Leu 
20 


Leu 


Leu 


Pro 


Tyr 


Leu 
25 


Leu 


Leu 


Gly 


Val 


Asn 
30 


Leu 


Phe 




ttt 


ttc 


acc 


ctg 


act 


tgt 


gga 


acc 


aat 


cct 


ggc 


att 


ata 


aca 


aaa 


gca 


303 


Phe 


Phe 


Thr 


Leu 


Thr 


Cys 


Gly 


Thr 


Asn 


Pro 


Gly 


He 


He 


Thr 


Lys 


Ala 








35 










40 










45 








aat 


gaa 


tta 


tta 


ttt 


ctt 


cat 


gtt 


tat 


gaa 


ttt 


gat 


gaa 


ktg 


atg 


ttt 


351 


Asn 


Glu 
50 


Leu 


Leu 


Phe 


Leu 


His 
55 


Val 


Tyr 


Glu 


Phe 


Asp 
60 


Glu 


Xaa 


Met 


Phe 




cca 


aaa 


aac 


gtg 


agg 


tgc 


tct 


act 


tgt 


gat 


tta 


agg 


aaa 


cca 


get 


cga 


399 


Pro 


Lys 


Asn 


Val 


Arg 


Cys 


Ser 


Thr 


Cys 


Asp 


Leu 


Arg 


Lys 


Pro 


Ala 


Arg 




65 










70 










75 










80 
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tec 


aas 


cac 


tgc 


aKt 


gtg 


tgt 


aac 


tgg 


tgt 


gtg 


cac 


cgt 


ttc 


rac 


eat 


447 


Ser 


Xaa 


His 


Cys 


Xaa 
85 


Val 


Cys 


Asn 


Trp 


Cys 
90 


Val 


His 


Arg 


Phe 


Xaa 
95 


His 




Cd.C 




gzz 


tgg 


gtg 


aac 


aac 


tgc 


ate 


ggg 


gcc 


tgg 


aac 


ate 


agg 


tmc 


495 


XT-! i-M 


Cys 


Val 


Trp 
100 


Val 


Asn 


Asn 


Cys 


He 
105 


Gly 


Ala 


Trp 


Asn 


He 
110 


Arg 


Xaa 




uCC 


etc 


ate 


tac 


gtc 


ttg 


acc 


ttg 


acg 


gcc 


teg 


get 


gee 


acc 


gtc 


gcc 


543 




Leu 


lie 
115 


Tyr 


Val 


Leu 


Thr 


Leu 
120 


Thr 


Ala 


Ser 


Ala 


Ala 
125 


Thr 


Val 


Ala 




att 


gtg 


age 


acc 


act 


ttt 


ctg 


gtc 


cac 


ttg 


gtg 


gtg 


atg 


tea 


gat 


tta 


591 


He 


Val 


Ser 


Thr 


Thr 


Phe 


Leu 


Val 


His 


Leu 


Val 


Val 


Met 


Ser 


Asp 


Leu 






130 










135 










140 










tac 


cag 


gag 


act 


tac 


ate 


gat 


gac 


ctt 


gga 


cac 


etc 


cat 


gtt 


atg 


gac 


639 


Tyr 


vjin 


Glu 


Thr 


Tyr 


He 


Asp 


Asp 


Leu 


Gly 


His 


Leu 


His 


Val 


Met 


Asp 




145 










150 










155 










160 




acg 


gtc 


ttt 


ctt 


att 


cag 


tac 


ctg 


ttc 


ctg 


act 


ttt 


cea 


egg 


att 


gtc 


687 


Thr 


Val 


Phe 


Leu 


He 
165 


Gin 


Tyr 


Leu 


Phe 


Leu 
170 


Thr 


Phe 


Pro 


Arg 


He 
175 


Val 




ttc 


atg 


ctg 


ggc 


ttt 


gtc 


gtg 


gtt 


ctg 


arc 


ttc 


etc 


ctg 


ggt 


ggc 


tac 


735 


Phe 


Met 


Leu 


Gly 


Phe 


Val 


Val 


Val 


Leu 


Xaa 


Phe 


Leu 


Leu 


Gly 


Gly 


Tyr 










180 










185 










190 






r'Ptg 


ttg 


ttt 


gtc 


ctg 


tat 


ctg 


gcg 


gcc 


ace 


aac 


cag 


act 


act 


aac 


gag 


783 




Leu 


Phe 


Val 


Leu 


Tyr 


Leu 


Ala 


Ala 


Thr 


Asn 


Gin 


Thr 


Thr 


Asn 


Glu 




tu 




195 










200 










205 










: "tgg 


tac 


aga 


rgt 


gae 


tgg 


gcc 


tgg 


tgc 


cag 


cgt 


tgt 


ccc 


ctt 


gtg 


gcc 


831 


?. in 


Tyr 


Arg 


Xaa 


Asp 


Trp 


Ala 


Trp 


Cys 


Gin 


Arg 


Cys 


Pro 


Leu 


Val 


Ala 






210 










215 










220 














cct 


ccg 


tea 


gea 


gar 


ccc 


caa 


gtc 


cac 


egg 


aac 


att 


cac 


tec 


cat 


879 




Pro 


Pro 


Ser 


Ala 


Glu 


Pro 


Gin 


Val 


His 


Arg 


Asn 


He 


His 


Ser 


His 




y?25 










230 










235 










240 






ctt 


egg 


arc 


aac 


ctt 


caa 


gar 


ate 


ttt 


eta 


cct 


gee 


ttt 


cea 


tgt 


927 




Leu 


Arg 


Xaa 


Asn 


Leu 


Gin 


Glu 


He 


Phe 


Leu 


Pro 


Ala 


Phe 


Pro 


Cys 












245 










250 










255 




rilbat 


gag 


agg 


aag 


aaa 


caa 


gaa 


tgacmagtgt atgactgcct ttgagctgta 


978 




Glu 


Arg 


Lys 


Lys 


Gin 


Glu 























;;J 260 

yjgttcccgttt atttacacat gtggatcctc gttttccaaa aaaaaaaaaa 1028 



<210> 360 
<211> 452 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 69. .434 

<221> sigjpeptide 

<222> 69. .236 

<223> Von Heijne matrix 

score 4.90000009536743 
seq FACVPGASPTTLA/FP 

<221> polyA_signal 
<222> 419. .424 
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<221> polyA_site 
<222> 441. .452 

<400> 360 

acagcgtgas tcgcccgcca gaagaatatg aaaaagcaga gcganctcgg ttaagggaaa 60 
gcgccgag atg acg ggc ttt ctg ctg ccg ccc gca age aga ggg act egg 110 
Met Thr Gly Phe Leu Leu Pro Pro Ala Ser Arg Gly Thr Arg 









-55 








-50 








-45 






aga 


tea 


tgc 


age aga 


age 


aga 


aaa 


agg 


caa 


acg 


aga 


aga 


agg 


agg 


aac 


158 


Arg 


Ser 


Cys 


Ser Arg 


Ser 


Arg 


Lys Arg Gin Thr Arg Arg 


Arg 


Arg Asn 








-40 








-35 










-30 










cca 


agt 


age 


ttt gtg 


get 


teg 


tgt 


cca 


acc 


etc 


ttg 


cec 


tte 


gee 


tgt 


206 


Pro 


Ser 
-25 


Ser 


Phe Val 


Ala 


Ser 
-20 


Cys 


Pro 


Thr 


Leu 


Leu 
-15 


Pro 


Phe 


Ala 


Cys 




gtg 


cct 


gga 


gee agt 


eee 


acc 


acg 


etc 


gcg 


ttt 


cct 


cct 


gta 


ktg 


etc 


254 


Val 


Pro 


Gly 


Ala Ser 


Pro 


Thr 


Thr 


Leu 


Ala 


Phe 


Pro 


Pro 


Val 


Xaa 


Leu 




-10 








-5 










1 








5 






aca 


ggt 


cec 


ave acc 


gat 


ggc 


att 


ccc 


ttt 


gee 


ctr 


nak 


tct 


gca 


gcg 


302 


Thr 


Gly 


Pro 


Xaa Thr Asp Gly 


He 


Pro 


Phe 


Ala 


Leu 


Xaa 


Ser 


Ala 


Ala 










10 








15 










20 








:pgt 


ccc 


ttt 


tgt get 


tec 


tte 


cec 


tea 


ggt 


ave 


etc 


tct 


eee 


cct 


ggg 


350 




Pro 


Phe 


Cys Ala 


Ser 


Phe 


Pro 


Ser Gly Xaa Leu Ser 


Pro 


Pro 


Gly 




y 




25 








30 










35 










,cca 


etc 


ccg 


ggg gtg 


agg 


ggg 


tta 


ccc 


ett 


ccc 


agt 


gtt 


ttt 


tat 


tec 


398 


i!pro 


Leu 


Pro 


Gly Val Arg Gly 


Leu 


Pro 


Leu 


Pro 


Ser 


Val 


Phe 


Tyr 


Ser 






40 








45 










50 














ggg 


get 


cac eee 


aaa 


gta 


tta 


aaa 


gta 


get 


ttg 


taatteaaaa 




444 


lihys 


Gly 


Ala 


His Pro 


Lys 


Val 


Leu 


Lys 


Val 


Ala 


Leu 












•■pi 

IBS 








60 










65 















V. aaaaaaaa 452 



<210> 


361 


:j:211> 


875 


5<212> 


DNA 


Jlc:213> 


Homo sapiens 


<220> 




<221> 


CDS 


<222> 


628. .804 


<221> 


sig_peptide 


<222> 


628. .711 



<223> Von Heijne matrix 

score 4.19999980926514 
seq LMPVIPALQEAXA/GG 

<221> polyA_site 
<222> 864 . .875 

<400> 361 

aaagatggac accgeggagg aagacatatg tagagtgtgt cggtcagaag gaacaeetga 60 

gaaacegctt tatcatcctt gtgtatgtac tggcagtatt aagttngtec ateaagaatg 120 

ettagtteaa tggctgaaac aeagtcgaaa agaatactgt gaattatgca ageaeagatt 180 

tgettttaca ceaatttatt ctceagatat gcettcaegg cttceaatte aagacatatt 240 



-301- 



tgctggactg gttacaagta ttggcactgc aatacgatat tggtttcatt atacacttgt 
ggcctttgca tggttgggag ttgttcctct tacagcatgt gagtattcat gcctctgatt 
ggagttattt aaacattgca taactactta atattataaa gcaatattgc atcatattat 
tatttgactg atgtttagtt atttgatgtc agagtgtcat gtattaggaa agccttactt 
araaratgtt catcggaact aaraatgakt ttaacaggtc agttttttga gtgaatgtgg 
gaaaraacac agcatacaga atggctaacc atgaaagttc atgaaagcgt kgaaaaaatc 
aaatcaaatc ataattagat atgaagt atg eta rag ctt tea agg get aca aaa 

Met Leu Xaa Leu Ser Arg Ala Thr Lys 
-25 -20 
rac ggc egg geg egg tgg ctt atg ect gta ate cea gea ctt cag gag 
Xaa Gly Arg Ala Arg Trp Leu Met Pro Val lie Pro Ala Leu Gin Glu 

-15 -10 -5 

gee gan gca ggc gga tea ega ggt cag gag ttt gaa act age ctg gee 
Ala Xaa Ala Gly Gly Ser Arg Gly Gin Glu Phe Glu Thr Ser Leu Ala 

15 10 
aac atg gag act gag gea gga gaa ttg ctt aaa ece agg agg egg agg 
Asn Met Glu Thr Glu Ala Gly Glu Leu Leu Lys Pro Arg Arg Arg Arg 

15 20 25 

ttg ear tgaaetgaga tegcaccact geacteeagc ttgggeaaca gageaagaet 
Leu Gin 
30 

;,|:tgtetegca aaaaaaaaaa a 

^"^<210> 362 

j<211> 531 

|5:212> DNA 

1^213 > Homo sapiens 

<220> 
^.:;<221> CDS 
'^f222> 70. .366 

''^<221> sigjpeptide 
'■Jk222> 70. .108 
|J:223> Von Heijne matrix 
score 3 . 5 

seq MHLLSNWANPASS/RR 

<221> polyA_signal 
<222> 496. .501 

<221> polyA_site 
<222> 521. .531 

<400> 362 

aagtggccat ggcggataea gcgaetacag categgeggc ggcggetagt gccgctageg 
cetegagcg atg eae etc ctt tec aac tgg gea aac ece get tec age aga 
Met His Leu Leu Ser Asn Trp Ala Asn Pro Ala Ser Ser Arg 
-10 -5 1 

cgt cct tet atg gee get tea ggc act tet tgg ata tea teg ace etc 
Arg Pro Ser Met Ala Ala Ser Gly Thr Ser Trp He Ser Ser Thr Leu 

5 10 15 

gca eae tet ttg tea ctg aga gae gte tea gag agg ctg tgc age tgc 
Ala His Ser Leu Ser Leu Arg Asp Val Ser Glu Arg Leu Cys Ser Cys 
20 25 30 
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tgg 


^99 


act 


4- 

aua 


age 


atg gga 


ccc 


tge 


gee 


egg 


ggg tea 


cca atg aac 


Trp 


Arg 


Thr 


He 


Ser 


Met Gly Pro Cys 


Ala 


Arg Gly Ser Pro Met Asn 




35 








40 










45 




age 


tct 


gga 


gtg 


cac 


aga aaa 


tea 


age 


agg 


eta 


ttc tac 


ate egg aea 


Ser 


Ser 


Gly 


Val 


His 


Arg Lys 


Ser 


Ser 


Arg 


Leu 


Phe Tyr 


lie Arg Thr 


50 










55 








60 




65 


cca 


atg 


aga 


aga 


tct 


tea tgc 


cat 


tta 


gaa 


tgt 


erg gtt 


ata ttc ctt 


Pro 


Met 


Arg 


Arg 


Ser 
70 


Ser Cys 


His 


Leu 


Glu 
75 


Cys 


Xaa Val 


He Phe Leu 
80 



255 



303 



351 



ttg gga cgc caa ttg taaktgttac cttcaaagga tttccttttc taaaaaatta 406 
Leu Gly Arg Gin Leu 
85 

ttttaratgt ctaaetttat gttattgcte aegggtattt gactgaattg ttgatttagg 466 

ataagtcaat tcctggaggg aaattaccaa ataaaatgat atgtatttet taeeacaaaa 526 

aaaaa 531 



<210> 363 
<211> 1244 
<212> DNA 
P»f:213> Homo sapiens 

:fic22o> 

p^l:221> CDS 
^|:222> 70. .366 

4<22l> sigjpeptide 

Ll:222> 70. .108 

fJ1:223> Von Heijne matrix 

score 3 . 5 
|,,^, seq MHLLSNWANPASS/RR 

r!<221> polyA_site 
f 5222> 1233. .1244 

'''■4 

Hik4:00> 363 

0!!^agtggccat ggeggataea gcgaetacag catcggcgge ggcggetagt geegctagcg 60 
cctegagcg atg eae etc ctt tec aac tgg gea aac ccc get tee age aga ill 
Met His Leu Leu Ser Asn Trp Ala Asn Pro Ala Ser Ser Arg 
-10 -5 1 



cgt 


ect 


tet 


atg 


gee 


get 


tea 


ggc 


act 


tet 


tgg 


ata 


tea 


teg 


ace 


etc 


159 


Arg 


Pro 


Ser 


Met 
5 


Ala 


Ala 


Ser 


Gly 


Thr 
10 


Ser 


Trp 


He 


Ser 


Ser 
15 


Thr 


Leu 




gea 


cac 


tct 


ttg 


tea 


etg 


aga 


gac 


gtc 


tea 


gag 


agg 


etg 


tgc 


age 


tgc 


207 


Ala 


His 


Ser 
20 


Leu 


Ser 


Leu 


Arg 


Asp 
25 


Val 


Ser 


Glu 


Arg 


Leu 
30 


Cys 


Ser 


Cys 




tgg 


agg 


act 


ata 


age 


atg 


gga 


ccc 


tgc 


gee 


egg 


ggg 


tea 


cca 


atg 


aac 


255 


Trp 


Arg 
35 


Thr 


He 


Ser 


Met 


Gly 
40 


Pro 


Cys 


Ala 


Arg 


Gly 
45 


Ser 


Pro 


Met 


Asn 




age 


tet 


gga 


gtg 


cac 


aga 


aaa 


tea 


age 


agg 


eta 


ttc 


tac 


ate 


egg 


aea 


303 


Ser 


Ser 


Gly 


Val 


His 


Arg 


Lys 


Ser 


Ser 


Arg 


Leu 


Phe 


Tyr 


He 


Arg 


Thr 




50 










55 










60 








65 




cca 


atg 


aga 


aga 


tct 


tea 


tgc 


cat 


tta 


raa 


tgt 


eag 


gtt 


ata 


ttc 


Ctt 


351 


Pro 


Met 


Arg 


Arg 


Ser 
70 


Ser 


Cys 


His 


Leu 


Xaa 
75 


Cys 


Gin 


Val 


He 


Phe 
80 


Leu 





ttg gga cgc caa ttg tagteggtct tctcttgece aaccagacac tggcatceae 406 
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Leu Gly Arg Gin Leu 
85 

tgtcttctgg cagtggctga accagagcca caatgcctgt gtcaactatg caaaccgcaa 
tgcraccaag ccttcacctg catccaagtt catccaggga tacctgggag ctgtcatcag 
cgccgtctcc attgctgtgg gccttatktc ctggttcaga aagccaacaa gttcacccca 
gccacccgcc ttctcatcca gaggtttgtg ccgttccctg ctgtagccag tgccaatatc 
tgcaatgtgg tcctgatgcg gtacggggag ctggaggaag ggattgatgt cctggacagc 
gatggcaacc tcgtgggctc ctccaagatc gcagcccgac acgccctgct ggagacggcg 
ctgacgcgag tggtcctgcc catgcccatc ctggtgctac ccccgatcgt catgtccatg 
ctggagaaga cggctctcct gcaggcacgc ccccggctgc tcctccctgt gcaaagcctc 
gtgtgcctgg cagccttcgg cctggccctg ccgctggcca tcagcctctt cccgcaaatg 
tcagagattg aaacatccca attagagccg gagatagccc aggccacgag cagccggaca 
gtggtgtaca acaaggggtt gtgagtgtgg tcagcggcct ggggacggag cactgtgcag 
ccggggagct gaggggcarg gccgtagact cacggctgca cctgcaggga gcagcacgcc 
aaccccagca gtcctgggcc ccctgggaga gtgctcaacc tacagtggag ggagactgac 
ccattcacat tttaacatag gcaagaggag ttctaacaca tttcgtacaa aaaaaaaa 



<210> 364 
<211> 631 

:,,|=212> DNA 

':.;l<213> Homo sapiens 

^4c220> 
"jj^c221> CDS 
Jlc222> 111. .434 

i,k221> sig_peptide 

J^222> 111. .185 

, <223> Von Heijne matrix 

^ score 3.90000009536743 

ji seq WIAAVTIAAGTAA/IG 

h 

<:221> polyA_site 
'^^222> 618 . .631 

^!k400> 364 

aatcgcggag tcggtgcttt agtacgccgc tggcaccttt actctcgccg gccgcgcgaa 
cccgtttgag ctcggtatcc tagtgcacac gccttgcaag cgacggcgcc atg agt 



Met Ser 
-25 



ctg 


act 


tec 


agt 


tec 


age 


gta 


ega 


gtt 


gaa 


tgg 


ate 


gca 


gca 


gtt 


ace 


Leu 


Thr 


Ser 


Ser 


Ser 


Ser 


Val 


Arg 


Val 


Glu 


Trp 


He 


Ala 


Ala 


Val 


Thr 








-20 










-15 










-10 






att 


get 


get 


ggg 


aca 


get 


gca 


att 


ggt 


tat 


eta 


get 


tac 


aaa 


aga 


ttt 


He 


Ala 


Ala 
-5 


Gly 


Thr 


Ala 


Ala 


He 
1 


Gly 


Tyr 


Leu 


Ala 


Tyr 


Lys 


Arg 


Phe 


tat 


gtt 


aaa 


gat 


cat 


cga 


aat 


aaa 


get 


atg 


ata 


5 

aac 


ctt 


eac 


ate 


eag 


Tyr 


Val 


Lys 


Asp 


His 


Arg 


Asn 


Lys 


Ala 


Met 


He 


Asn 


Leu 


His 


He 


Gin 


10 










15 










20 










25 


aaa 


gac 


aac 


ccc 


aag 


ata 


gta 


cat 


get 


ttt 


gac 


atg 


gag 


gat 


ttg 


gga 


Lys 


Asp 


Asn 


Pro 


Lys 


He 


Val 


His 


Ala 


Phe 


Asp 


Met 


Glu 


Asp 


Leu 


Gly 










30 










35 










40 


gat 


aaa 


get 


gtg 


tac 


tgc 


cgt 


tgt 


tgg 


agg 


tec 


aaa 


aag 


ttc 


cca 


ttc 


Asp 


Lys 


Ala 


Val 


Tyr 


Cys 


Arg 


Cys 


Trp 


Arg 


Ser 


Lys 


Lys 


Phe 


Pro 


Phe 








45 










50 










55 
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tgt gat ggg get cac aca aaa cat aac gaa gag act gga gac aat gtg 404 
Cys Asp Gly Ala His Thr Lys His Asn Glu Glu Thr Gly Asp Asn Val 

60 65 70 

ggc cct ctg ate ate aag aaa aaa gaa act taaatggaca cttttgatgc 454 
Gly Pro Leu lie lie Lys Lys Lys Glu Thr 

75 80 
tgcaaatcag cttgtcgtga agttacctga ttgtttaatt araatgaeta ccacctctgt 514 
ctgattcacc ttcgctggat tctaaatgtg gtatattgcm aactgcagct ttcacattta 574 
tggeatttgt ettgttgaaa catcgtggtg caeatttgtt taaacaaaaa aaaaaaa 631 



<210> 365 
<211> 781 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 19. .567 

.i;jf221> sig_jpeptide 

'^|c222> 19. .63 

^J^fc223> Von Heijne matrix 

score 8.39999961853027 

seq AMWLLCYALAVLA/WG 

Ji<221> polyA^signal 
i:Jc222> 749. .754 

<221> polyA_site 
j^^l<:222> 771. .781 

!'^c400> 365 

pagtgctget tacccatc atg gaa gca atg tgg etc ctg tgt gtg gcg ttg 51 
2 Glu Ala Met Trp Leu Leu Cys Val Ala Leu 

-15 -10 -5 

Ofecg gtc ttg gca tgg ggc ttc etc tgg gtt tgg gac tec tea gaa ega 99 
Ala Val Leu Ala Trp Gly Phe Leu Trp Val Trp Asp Ser Ser Glu Arg 

15 10 
atg aag agt egg gag cag gga aga egg ctg gga gee gaa age egg ace 147 
Met Lys Ser Arg Glu Gin Gly Arg Arg Leu Gly Ala Glu Ser Arg Thr 

15 20 25 

ctg ctg gtc ata gcg cac cct gac gat gaa gee atg ttt ttt get ecc 195 
Leu Leu Val He Ala His Pro Asp Asp Glu Ala Met Phe Phe Ala Pro 

30 35 40 

aca gtg eta ggc ttg gee ege eta agg cac tgg gtg tac ctg ett tgc 243 
Thr Val Leu Gly Leu Ala Arg Leu Arg His Trp Val Tyr Leu Leu Cys 
45 50 55 60 

ttc tct gca gga aat tac tac aat caa gga gag act cgt aag aaa gaa 291 
Phe Ser Ala Gly Asn Tyr Tyr Asn Gin Gly Glu Thr Arg Lys Lys Glu 

SB 70 75 

ett ttg car age tgt gat gtt ttg ggg att cea etc tec agt gta atg 339 
Leu Leu Gin Ser Cys Asp Val Leu Gly He Pro Leu Ser Ser Val Met 

80 85 90 

att att gac aac agg gat ttc cea rat gac cea ggc atg eag tgg gac 387 
He He Asp Asn Arg Asp Phe Pro Xaa Asp Pro Gly Met Gin Trp Asp 
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95 100 105 

aca rag cac gtg gcc ara gtc etc ctt cag cac ata gaa gtg aat ggc 435 
Thr Xaa His Val Ala Xaa Val Leu Leu Gin His He Glu Val Asn Gly 

110 115 120 

ate aat ctg gtg gtg act ttc gat gca ggg gga rta agt ggc cac age 483 
He Asn Leu Val Val Thr Phe Asp Ala Gly Gly Xaa Ser Gly His Ser 
125 130 135 140 

aat cac att get ctg tat gea get gtg agg aag ctt gag ggc caa att 531 
Asn His He Ala Leu Tyr Ala Ala Val Arg Lys Leu Glu Gly Gin He 

145 150 155 

tgc aag cec tgt ggc act gga caa gae ttt aag gaa tgagtgetgt 577 
Cys Lys Pro Cys Gly Thr Gly Gin Asp Phe Lys Glu 

160 165 
caateagtgt gectccacet teaceatett etteccctta ctctcaettc cgtcatgtgt 637 
tttatacaac tctcaaatet ttcttggaga aggaggatat acatacataa tatgaaatgt 697 
gtttgttctt eaeagtcaee cgattttaet gatatttatt tgcattttac caataaaaag 757 
aaaatgcaag etcaaaaaaa aaaa 781 



<210> 366 
,,,<:211> 931 
^':f<212> DNA 
;f^213> Homo sapiens 

■ 

J1c221> CDS 
|S<222> 19. .312 

j1<221> sig_peptide 
f <222> 19. .63 
;|^<223> Von Heijne matrix 

score 8.39999961853027 

seq AMWLLCVALAVLA/WG 

"^k221> polyA^signal 
|3k222> 896. .901 

<221> polyA_site 
<222> 921. .931 

<400> 366 

aagtgctget tacceatc atg gaa gea atg tgg etc ctg tgt gtg gcg ttg 51 

Met Glu Ala Met Trp Leu Leu Cys Val Ala Leu 
-15 -10 -5 

gcg gtc ttg gca tgg ggc ttc etc tgg gtt tgg gac tec tea gaa ega 99 
Ala Val Leu Ala Trp Gly Phe Leu Trp Val Trp Asp Ser Ser Glu Arg 

15 10 
atg aag agt egg gag cag gga rga egg ctg gga gee gaa age egg acc 147 
Met Lys Ser Arg Glu Gin Gly Xaa Arg Leu Gly Ala Glu Ser Arg Thr 

15 20 25 

Ctg ctg gtc ata gcg cac ect gae gat gaa gcc atg ttt ttt get cec 195 
Leu Leu Val He Ala His Pro Asp Asp Glu Ala Met Phe Phe Ala Pro 

30 35 40 

aca gtg eta ggc ttg gcc egc eta agg cac tgg gtg tac ctg ctt tgc 243 
Thr Val Leu Gly Leu Ala Arg Leu Arg His Trp Val Tyr Leu Leu Cys 
45 50 55 60 



-306- 



ttc tct gca gtt ttc cgt agg gag eta agt gaa tac acc gaa rgt ctt 2 91 
Phe Ser Ala Val Phe Arg Arg Glu Leu Ser Glu Tyr Thr Glu Xaa Leu 

65 70 75 

acc tct gaa ccc etc ama gcc tagggacagg arcggccggc ttacctggtg 342 
Thr Ser Glu Pro Leu Xaa Ala 
80 

ggttggggga cgtcggcagc tcrcgtacta cgccagcagg attganganc acagaaacag 402 

ttgchsttgg ttgtattcag tacctkcatt tccgttggga actccaccwg tacttgttat 462 

kctgtggaac ttttttttat ttgtagaagg agcaagaata ttgaccttac tatatagcac 522 

acgaaacaat ctatgctgta tcgtgcctgc tcaatcctta aagttaactt etaatgatag 582 

taaaaracet tcctgctgcc tttaaaatgc agcttgtgct aktaacatgc atgtgtcaaa 642 

ttgaaraatt agacatagat gactaratar aaagtaattt tgtaggtaat tttaragttc 702 

aactccaccc agctttcakt gaaggaacct ttcaaataat aratttttgc ttaccatara 762 

raaaaratca aatgacaaag caaatattga ccattaagct ggaatatggt gataattgaa 822 

cagttgtata aatgaaktaa ttgaattgta cacatacaat gggtgaattt tatggcatgt 882 

caaagtatac ctcaataaag ctattttttt aaattgcmaa aaaaaaaaa 931 



<210> 367 

<211> 849 
.nf212> DNA 
'::J:213> Homo sapiens 

^;4:220> 
'i<:221> CDS 
J<222> 64. .612 

,^J:221> sig_peptide 

J%222> 64. .234 

; <223> Von Heijne matrix 

score 3.79999995231628 
seq QLWLVMEFCGAGS/VT 



■""<:221> polyA_site 
'4:222> 839. .849 



;)!l;400> 367 

acatacgggc aagtttataa gggtcgtcat gtcaaaacgg gecagcttgc agccatcaag 60 

gtt atg gat gtc aca ggg gat gaa gag gaa gaa ate aaa caa gaa att 108 

Met Asp Val Thr Gly Asp Glu Glu Glu Glu lie Lys Gin Glu lie 

-55 -50 -45 

aac atg ttg aag aaa tat tct cat cac egg aat att get aca tac tat 156 

Asn Met Leu Lys Lys Tyr Ser His His Arg Asn lie Ala Thr Tyr Tyr 

-40 -35 -30 

ggt get ttt ate aaa aag aac cca cca ggc atg gat gae caa ctt tgg 204 

Gly Ala Phe lie Lys Lys Asn Pro Pro Gly Met Asp Asp Gin Leu Trp 

-25 -20 -15 

ttg gtg atg gag ttt tgt ggt get ggc tct gtc acc gae ctg ate aag 252 

Leu Val Met Glu Phe Cys Gly Ala Gly Ser Val Thr Asp Leu lie Lys 

-10 -5 15 

aac aca aaa ggt aac aeg ttg aaa gag gag tgg att gca tac ate tgc 300 

Asn Thr Lys Gly Asn Thr Leu Lys Glu Glu Trp lie Ala Tyr lie Cys 

10 15 20 

msg gaa ate tta egg ggg ctg art cac ctg cac eag cat aaa gtg att 348 

Xaa Glu He Leu Arg Gly Leu Xaa His Leu His Gin His Lys Val He 

25 30 35 
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cat cga rat att aaa ggg caa aat 
His Arg Xaa lie Lys Gly Gin Asn 

40 45 
gtt aaa eta gtg gac ttt gga rtc 
Val Lys Leu Val Asp Phe Gly Xaa 
55 60 
ggc agg arg aat act ttc att gga 
Gly Arg Xaa Asn Thr Phe lie Gly 
75 

gtt att gcc tgt gat gaa aac cca 
Val lie Ala Cys Asp Glu Asn Pro 
90 

gac ttg tgg tct ttg ggt ate acc 
Asp Leu Trp Ser Leu Gly lie Thr 
105 110 
ccc etc tct gtg aca tgc ace cca 
Pro Leu Ser Val Thr Cys Thr Pro 

120 125 
cagcgcctcg gctgaagtet aagaagtggt caaaaaaatt ceagtcattt attgagaget 702 
gcttggtaaa aaateaeagc cagegaccag caaeagaaca attgatgaag catccattta 762 
tacgagacca acctaatgag cgaeaggtcc geatteaact caaggaceat attgatagaa 822 
j;,,paaagaagaa gcgaggaaaa aaaaaaa 849 

^=^J:210> 368 

yi211> 644 

.|S:212> DNA 

|j^k213> Homo sapiens 

J <220> 
i^.<221> CDS 
r<222> 39. .458 

f''<221> sig__peptide 
''4:222> 39. .80 
y,|c223> Von Heijne matrix 
03 score 4.40000009536743 

seq FLTALLWRGRIPG/RQ 

<221> polyA_signal 
<222> 613 . .618 

<221> polyA_site 
<222> 633 . .644 

<400> 368 

agcggagacg cagagtcttg agcagcgcgn caggcace atg ttc ctg act gcg etc 56 

Met Phe Leu Thr Ala Leu 
-10 

etc tgg cgc ggc egc att ccc ggc cgt cag tgg ate ggg aag cac egg 104 
Leu Trp Arg Gly Arg He Pro Gly Arg Gin Trp He Gly Lys His Arg 

-5 15 
egg ecg egg ttc gtg teg ttg cgc gcc aag cag aac atg ate cgc cgc 152 
Arg Pro Arg Phe Val Ser Leu Arg Ala Lys Gin Asn Met He Arg Arg 

10 15 20 

c:tg gag ate gag gcg gag aac cat tae tgg ctg age atg ccc tac atg 200 



gtc ttg ctg act gaa aat gca gaa 396 
Val Leu Leu Thr Glu Asn Ala Glu 
50 

akt get cag ett gat cga aca gtg 444 
Xaa Ala Gin Leu Asp Arg Thr Val 

65 70 
act ccc tac tgg atg gca cca raa 492 
Thr Pro Tyr Trp Met Ala Pro Xaa 

80 85 
sat gcc aca tat gat ttc aar art 540 
Xaa Ala Thr Tyr Asp Phe Lys Xaa 
95 100 

gcc att gaa atg gca gaa ggg etc 588 
Ala He Glu Met Ala Glu Gly Leu 
115 

tgagagctct ettectcate ecccggaate 642 
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Leu Glu lie Glu Ala Glu Asn His Tyr Trp Leu Ser Met Pro Tyr Met 

25 30 35 40 

acc egg gag cag gag cgc ggc cac gcc gcg ttg cgc agg agg gag gcc 248 

Thr Arg Glu Gin Glu Arg Gly His Ala Ala Leu Arg Arg Arg Glu Ala 

45 50 55 

ttc gag gcc ata aag gcg gcc gcc act tec aag ttc ccc ccg cat aga 296 
Phe Glu Ala lie Lys Ala Ala Ala Thr Ser Lys Phe Pro Pro His Arg 

60 65 70 

ttc att gcg gac cag etc gac cat etc aat vgt cac caa gaa atg gtc 344 
Phe lie Ala Asp Gin Leu Asp His Leu Asn Xaa His Gin Glu Met Val 

75 80 85 

eta ate ctg agt cgt cac cet tgg att tta tgg ate aeg gag ctg ace 392 
Leu lie Leu Ser Arg His Pro Trp lie Leu Trp lie Thr Glu Leu Thr 

90 95 100 

ate ttt acc tgg tet gga ctg aaa aac tgt age ttg tgt gaa aat gag 440 
He Phe Thr Trp Ser Gly Leu Lys Asn Cys Ser Leu Cys Glu Asn Glu 
105 110 115 120 

ett tgg ace agt ett tat taaaacaaac aaacatgagt agtctgcata 488 
Leu Trp Thr Ser Leu Tyr 

125 

tcgaatatet agagctctaa accccccaat acttaaaagt ctaattgctg tcctgtggtt 548 
j^^fcattagtet gataggaaga tagggattte ctcagtcaea gatgatattt tgaaggaaag 608 
'^Jjtgcaataaa gecacaatga tttgaaaaaa aaaaaa 644 



yi:210> 369 
4<211> 918 
|jj;212> DNA 
yl213> Homo sapiens 

hf221> CDS 
r|:222> 9. .185 

'['k221> sig_peptide 
yi222> 9. .50 

0;:fe:223> Von Heijne matrix 

score 3.70000004768372 
seq AALVTVLFTGVRR/LH 

<221> polyA_site 
<222> 906. .918 

<400> 369 

agctcagc atg get get tta gtg act gtt etc ttc aca ggt gtc egg agg 50 
Met Ala Ala Leu Val Thr Val Leu Phe Thr Gly Val Arg Arg 
-10 -5 
ctg cac tgc age gcr seg ett ggg egg gcg gcc agt ggc grc tae age 98 
Leu His Cys Ser Ala Xaa Leu Gly Arg Ala Ala Ser Gly Xaa Tyr Ser 
is 10 15 

agg aac tgg ctg cca acc cet ccg get aeg ggc ccc tta ccg age tec 146 
Arg Asn Trp Leu Pro Thr Pro Pro Ala Thr Gly Pro Leu Pro Ser Ser 

20 25 30 

cag act ggt eat atg egg atg gee gcc ctg etc ccc caa tgaaaggcea 195 
Gin Thr Gly His Met Arg Met Ala Ala Leu Leu Pro Gin 
35 40 45 
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gcttcgaaaa aaagctgaaa gggagacktt tgcaaracra kttgtactgc tgtcacagga 255 

aatggacgct ggattacaas catggcasct caggcagcar aakttgcagg aaraacaaag 315 

gaagcaggaa aatgctctta aacccaaagg ggcttcactg aaaascccac ttccaaktca 375 

ataaaaagca actcctgcct cccttcctca ccctgtctct ggatttcttt tctatcacct 435 

aratgcttca tccagccara aaatagcctt cackktcccc atctgtcttc aragcaaaar 495 

agctgggacm ccaaraacaa gctgttarat cactgcctgg gaggcttggc ttartactct 555 

catctctggt tccattccag ttcagctaag tcttgcttta aaatttttac ctcctagctg 615 

ggtgcggtgg ctcacgcctg taatcccagc actttgggag gctgaggcgg gcagatcaca 675 

agatcaggag ttcgagacca gcctggccaa cccagcctgg tcaacatggt gaaaccctgt 735 

ccctactaaa gatacaaaca attagccggg cgtggtgggg tgcgcttgta atcccagcta 795 

ctcaggaggc tgaggcagga gaatcgctta aactcgggag gtagaggttg cagtgagcca 855 

aggtcacacc attgcactcc aacctgggcg acagggcgag actctgtctc aaaaaaaaaa 915 

aaa 918 



<210> 370 
<211> 472 
<212> DNA 
<213> Homo sapiens 

220> 
''Sc221> CDS 
i;Jlc222> 14, .316 

r\22X> sig_peptide 

Uk222> 14. .121 

5=1^22 3 > Von Heijne matrix 

yj score 5.19999980926514 

m seq PLRLLNLLILIEG/SV 

L^<:221> polyA^signal 
J:^ ii<222> 442 . .447 

[^'J<221> polyA^site 
"%222> 458. .471 

0Ik4OO> 370 

attatataga gcc atg ggg act tac aac gtg gca gtg cct tea gat gta 49 
Met Gly Pro Tyr Asn Val Ala Val Pro Ser Asp Val 
-35 -30 -25 

tct cat gcc cgc ttt tat ttc tta ttt cat cga cca tta agg ctg tta 97 

Ser His Ala Arg Phe Tyr Phe Leu Phe His Arg Pro Leu Arg Leu Leu 

-20 -15 -10 

aat ctg etc ate ctt att gag ggc agt gtc gtc ttc tat cag etc tat 145 

Asn Leu Leu lie Leu lie Glu Gly Ser Val Val Phe Tyr Gin Leu Tyr 

-5 15 

tec ttg ctg egg teg gag aag tgg aac cae aca ctt tec atg get etc 193 

Ser Leu Leu Arg Ser Glu Lys Trp Asn His Thr Leu Ser Met Ala Leu 

10 15 20 

ate etc ttc tgc aac tac tat gtt tta ttt aaa ctt etc egg gac aga 241 

lie Leu Phe Cys Asn Tyr Tyr Val Leu Phe Lys Leu Leu Arg Asp Arg 
25 30 35 40 

wta kta tta ggc agg gca tac tec tac cca etc aac agt tat gaa etc 2 89 

Xaa Xaa Leu Gly Arg Ala Tyr Ser Tyr Pro Leu Asn Ser Tyr Glu Leu 

45 50 55 

aag gca aac twa get gcc tct caw caa tgagggagaa cteagataaa 336 
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Lys Ala Asn Xaa Ala Ala Ser Xaa Gin 

60 65 
aatattttca tacgttctat ttttttcttg tgatttttat aaatatttaa gatattttat 396 
attttgtata ctattatgtt ttgaaagtcg ggaagagtaa gggatattaa atgtatccgt 456 
aaacaaaaaa aaaaam 472 



<210> 371 

<211> 1504 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 70. .1092 

<221> sig_peptide 

<222> 70. .234 

<223> Von Heijne matrix 

score 4.09999990463257 

seq AVCAALLASHPTA/EV 

';J1;221> polyA^signal 
J=4c222> 1475. .1480 

y1:221> polyA_site 
|5:222> 1493. .1504 

i^|:400> 371 

: agaaatcgta ggacttccga aagcagcggc ggcgtttgct tcactgcttg gaagtgtgag 60 
,:fgcgcgaag atg cga aag gtg gtt ttr att acc ggg get age agt ggc att 111 

I^st Arg Lys Val Val Leu lie Thr Gly Ala Ser Ser Gly lie 
1 -55 -50 -45 

''%gc ctg gcc etc tge aag egg ctg ctg gcg gaa gat gat gag ett cat 159 
'|fely Leu Ala Leu Cys Lys Arg Leu Leu Ala Glu Asp Asp Glu Leu His 
l!l -40 -35 -30 

3!!btg tgt ttg geg tge agg aat atg age aag gca gaa get gtc tgt get 2 07 

Leu Cys Leu Ala Cys Arg Asn Met Ser Lys Ala Glu Ala Val Cys Ala 
-25 -20 -15 -10 

get ctg ctg gee tct eac eee act get gag gtc acc att gtc eag gtg 255 
Ala Leu Leu Ala Ser His Pro Thr Ala Glu Val Thr He Val Gin Val 

-5 15 
gat gtc age aac ctg eag tea tte ttc egg gcc tec aag gaa ett aag 303 
Asp Val Ser Asn Leu Gin Ser Phe Phe Arg Ala Ser Lys Glu Leu Lys 

10 15 20 

caa agg ttt cag aga tta gae tgt ata tat eta aat get ggg ate atg 351 
Gin Arg Phe Gin Arg Leu Asp Cys He Tyr Leu Asn Ala Gly He Met 

25 30 35 

cet aat cca caa eta aat ate aaa gca ett ttc ttt ggc etc ttt tea 399 
Pro Asn Pro Gin Leu Asn He Lys Ala Leu Phe Phe Gly Leu Phe Ser 
40 45 50 55 

aga aaa gtg att cat atg ttc tec aca get gaa ggc ctg ctg aec cag 447 
Arg Lys Val He His Met Phe Ser Thr Ala Glu Gly Leu Leu Thr Gin 

60 65 70 

ggt gat aag ate act get gat gga ett cag gag gtg ttt gag ace aat 495 
Gly Asp Lys He Thr Ala Asp Gly Leu Gin Glu Val Phe Glu Thr Asn 
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75 80 85 



gtc 


■h -h -H 
t U U 




cau 


-{- 1- -1- 


■3 -H l-t 

acc 


ctg 


acc 


egg 


gaa 


ctg 


gag 


cct 


etc 


etc 


tgt 


543 




Jrlie 


r*l -fir 

<aiy 


ills 


pne 


lie 


Leu 


lie 


Arg 


Glu 


Leu 


Glu 


Pro 


Leu 


Leu 


Cys 




























100 












agt 


gac 


aac 


cca 


tct 


eag 


etc 


ate 


tgg 


aca 


tea 


tct 


cgc 


agt 


gca 


591 




Ser 


Asp 


Asn 


Pro 


Ser 


Gin 


Leu 


He 


Trp 


Thr 


Ser 


Ser 


Arg 


Ser 


Ala 
















llU 










115 














aaa 


tct 


aat 


ttc 


age 


etc 


gag 


gac 


ttc 


eag 


cac 


age 


aaa 


ggc 


aag 


639 




Lys 


Ser 


Asn 


Phe 


Ser 


Leu 


Glu 


Asp 


Phe 


Gin 


His 


Ser 


Lys 


Gly 


Lys 














125 










130 










135 




gaa. 


ccc 


tac 


age 


tct 


tec 


aaa 


tat 


gee 


act 


gac 


ctt 


ttg 


agt 


gtg 


get 


687 




Pro 


Tyr 


Ser 


Ser 


Ser 


Lys 


Tyr 


Ala 


Thr 


Asp 


Leu 


Leu 


Ser 


Val 


Ala 












14 U 










145 










150 






ttg 


aac 


agg 


aae 


ttc 


aac 


eag 


eag 


ggt 


etc 


tat 


tec 


aat 


gtg 


gcc 


tgt 


735 


Leu 


Asn 


Arg 


Asn 


Phe 


Asn 


Gin 


Gin 


Gly 


Leu 


Tyr 


Ser 


Asn 


Val 


Ala 


Cys 










155 










160 










165 








cca 


ggt 


aca 


gca 


ttg 


acc 


aat 


ttg 


aca 


tat 


gga 


att 


ctg 


cct 


ccg 


ttt 


783 


Pro 


Gly 


Thr 


Ala 


Leu 


Thr 


Asn 


Leu 


Thr 


Tyr 


Gly 


He 


Leu 


Pro 


Pro 


Phe 








170 










175 










180 










ata 


tgg 


acg 


ctg 


ttg 


atg 


ccg 


gca 


ata 


ttg 


eta 


ctt 


cgc 


ttt 


ttt 


gca 


831 


lie 


Trp 


Thr 


Leu 


Leu 


Met 


Pro 


Ala 


lie 


Leu 


Leu 


Leu 


Arg 


Phe 


Phe 


Ala 






185 










190 










195 














gca 


ttc 


act 


ttg 


aca 


cca 


tat 


aat 


gga 


aca 


gaa 


get 


ctg 


gta 


tgg 


879 




Ala 


Phe 


Thr 


Leu 


Thr 


Pro 


Tyr 


Asn 


Gly 


Thr 


Glu 


Ala 


Leu 


Val 


Trp 














205 










210 










215 






ttc 


cac 


caa 


aag 


cct 


gaa 


tct 


etc 


aat 


cct 


ctg 


ate 


aaa 


tat 


ctg 


927 


Leu 


Phe 


His 


Gin 


Lys 


Pro 


Glu 


Ser 


Leu 


Asn 


Pro 


Leu 


He 


Lys 


Tyr 


Leu 












220 










225 










230 








gcc 


acc 


act 


ggc 


ttt 


gga 


aga 


aat 


tac 


att 


atg 


acc 


eag 


aag 


atg 


975 


[i'per 


Ala 


Thr 


Thr 


Gly 


Phe 


Gly 


Arg 


Asn 


Tyr 


He 


Met 


Thr 


Gin 


Lys 


Met 










235 










240 










245 










eta 


gat 


gaa 


gac 


act 


get 


gaa 


aaa 


ttt 


tat 


caa 


aag 


tta 


ctg 


gaa 


1023 




Leu 


Asp 


Glu 


Asp 


Thr 


Ala 


Glu 


Lys 


Phe 


Tyr 


Gin 


Lys 


Leu 


Leu 


Glu 








250 










255 










260 










gaa 


aag 


cac 


att 


agg 


gtc 


act 


att 


caa 


aaa 


aca 


gat 


aat 


eag 


gee 


1071 


'^teu 


Glu 


Lys 


His 


He 


Arg 


Val 


Thr 


He 


Gin 


Lys 


Thr 


Asp 


Asn 


Gin 


Ala 






265 










270 










275 














etc 


agt 


ggc 


tea 


tgc 


eta 


taattecagc actttgggag gecaaggcag 


1122 


Arg 


Leu 


Ser 


Gly 


Ser 


Cys 


Leu 






















280 










285 

























aaggatcaet tgagaccagg agttcaagac cagectgaga aacatagtga geccttgtet 1182 

ctacaaaaag aaataaaaat aatagctggg tgtggtggca tgcgcatgta gtcecagcta 1242 

cteagaagga tgaggtggga ggatctettg aggctgggag geagaggttg cagtgagctg 1302 

agattgtgce actgcactec agcctgggtg acagegagac cetgtctcaa aatatgtata 1362 

tatttaatat atatataaaa ccagagctga caatgacact ctggaacatt gcatacctte 1422 

tgtacattet ggggtacatg gatttctact gagttggata atatgcattt gtaataaact 1482 

atgaactatg aaaaaaaaaa aa 1504 



<210> 372 
<211> 765 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
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<222> 274 . .597 

<221> sig_jpeptide 

<222> 274. .399 

<223> Von Heijne matrix 

score 5.19999980926514 

seq LLFDLVCHEFCQS/DD 

<221> polyA_signal 
<222> 731. .736 

<221> polyA_site 
<222> 754 . .765 

<400> 372 

accaggaaca tccagctatt tatgatagca tttgcttcat tatgtcaagt tcaacaaatg 60 

ttgacttgct ggtgaaggtg ggggaggttg tggacaagct ctttgatttg gatgagaaac 120 

taatgttaag aatgggtcag aaatggggct gctcagcctc tggaccaacc ccaggaagag 180 

tctgaagagc agccagtgtt tcggcttgtg ccctgtatac ttgaagctgc caaacaagta 240 

cgttctgaaa atccagaatg gcttgatgtt tac atg cac att tta caa ctg ctt 2 94 

Met His lie Leu Gin Leu Leu 

























-40 










''^^ct 


aca 


gtg 


gat 


gat 


gga 


att 


caa 


gca 


att 


gta 


cat 


tgt 


cct 


gac 


act 


342 


|!!:1rhr 


Thr 


Val 


Asp 


Asp 


Gly 


He 


Gin 


Ala 


He 


Val 


His 


Cys 


Pro 


Asp 


Thr 




^^^35 










-30 










-25 










-20 






aaa 


gac 


att 


tgg 


aat 


tta 


ctt 


ttt 


gac 


ctg 


gtc 


tgc 


cat 


gaa 


ttc 


390 


^{Gly 


Lys 


Asp 


He 


Trp 


Asn 


Leu 


Leu 


Phe 


Asp 


Leu 


Val 


Cys 


His 


Glu 


Phe 












-15 










-10 










-5 






i;^Jtgc 


cag 


tct 


gat 


gat 


cca 


gcc 


ate 


att 


ctt 


caa 


raa 


car 


aaa 


acr 


gtg 


438 


fjlpys 


Gin 


Ser 


Asp 
1 


Asp 


Pro 


Ala 


He 
5 


He 


Leu 


Gin 


Xaa 


Gin 
10 


Lys 


Thr 


Val 




ij.,-pta 


gcc 


tct 


gtt 


ttt 


tea 


gtg 


ttg 


tct 


gcc 


ate 


tat 


gcc 


tea 


cag 


act 


486 


11 ^'^^ 


Ala 
15 


Ser 


Val 


Phe 


Ser 


Val 
20 


Leu 


Ser 


Ala 


He 


Tyr 
25 


Ala 


Ser 


Gin 


Thr 




: gag 


caa 


gak 


tat 


eta 


aar 


ata 


raa 


aaa 


gga 


gac 


ggt 


ggc 


tea 


ggg 


agt 


534 




Gin 


Xaa 


Tyr 


Leu 


Lys 


He 


Xaa 


Lys 


Gly 


Asp 


Gly 


Gly 


Ser 


Gly 


Ser 




^lifeo 










35 










40 










45 




iljkaa 


gga 


agg 


cca 


ktt 


gan 


caa 


aca 


gaa 


ktg 


ttc 


etc 


tgc 


att 


tea 


aaa 


582 


Lys 


Gly 


Arg 


Pro 


Xaa 
50 


Xaa 


Gin 


Thr 


Glu 


Xaa 
55 


Phe 


Leu 


Cys 


He 


Ser 
60 


Lys 




cct 


tct 


tec 


ttt 


eta 


tagccctgtg gtggaagatt ttattaaaat cctacgtgaa 


637 


Pro 


Ser 


Ser 


Phe 
65 


Leu 



























gttgataagg egcttgctga tgacttggaa aaaaacttcc eaagtttgaa ggttcagact 697 
taaaacctga attggaatta ettctgtaca agaaataaac tttattttte teaetgacaa 757 
aaaaaaaa 7 65 



<210> 373 
<211> 1041 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 230. .469 
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<221> sig_peptide 

<222> 230. .307 

<223> Von Heijne matrix 

score 4.90000009536743 

seq VLCTNQVLITARA/VP 

<221> polyA_signal 
<222> 1004. ,1009 

<221> polyA_site 
<222> 1027. .1040 

<400> 373 

aacttccaag ttgtagtgtt gttgttttca gcctgctgct gctgctgcta ttgcggctag 60 
gggaaccgtc gtggggaagg atggtgtgcg aaaaatgtga aaagaaactt ggtactgtta 120 
tcactccaga tacatggaaa gatggtgcta ggaataccac agaaagtggt ggaagaaagc 180 
tgaatgaaaa taaagctttg acttcaaaaa aagccagaat tgatccata atg gaa gaa 23 8 

Met Glu Glu 
-25 

ata agt tct cca ctt gta gaa ttt gta aaa gtt ttg tgc acc aac cag 286 
le Ser Ser Pro Leu Val Glu Phe Val Lys Val Leu Cys Thr Asn Gin 
-20 -15 -10 

ijirjgtt etc att act gcc agg get gtg cct aca aaa aag gca tct gtg cga 334 
I'Vsil Leu lie Thr Ala Arg Ala Val Pro Thr Lys Lys Ala Ser Val Arg 

J;!J -5 1 5 

'^1^9^ gtg gaa aaa agg ttt tgg ata cca aaa act aca age aaa cat ctg 382 

i'^l^ys Val Glu Lys Arg Phe Trp lie Pro Lys Thr Thr Ser Lys His Leu 

UkO 15 20 25 

U'tet aga tgt att gat gga att tct gge ttt eta aat gat ttt act ttc 430 

5i Ser Arg Cys lie Asp Gly lie Ser Gly Phe Leu Asn Asp Phe Thr Phe 

3 0 35 40 

lijtgc ett gaa ttt tea agg cat aga tgt caa ett aca gaa taacatgtkt 479 
l^pys Leu Glu Phe Ser Arg His Arg Cys Gin Leu Thr Glu 
[1 45 50 

.jjtaagataatt aagtktaaac eagaraattt gattgttact cattttgctc teatgtJccta 539 
];J%aaeageaac agtgtaacta gtettttgtt gtaaatggtt attttcetta taaaaatttt 5 99 
y^kaaaaetaag tggcaaatte catgaaaata ttteteagtt ctgtatgcac ttttatttaa 659 
cattattcat ataattctce ceccaccaet ttatttataa atactgeaaa aktgaraagg 719 
agataataaa tactttgctc tgaatttgge atccaaagtt aacatttctc ccctcactcc 779 
ettgctggtg teatagttat tagaatcagc ageetcttaa etaattgcgg ttteatagga 839 
tatataaatg ttteaageca ttattgctga atggttcttt agttattaae ctagaeccaa 899 
atcaaagacc agttggattt atgatatttt ttatttgtte ttgcageeaa agtgeeagtt 959 
tctttaatat gtgaeeaaga aeaeaaggag catccatatg gecaaataaa tacactgaat 1019 
tttagaaaaa caaaaaaaaa ar 1041 



<210> 374 
<211> 1164 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 72 . .545 
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<221> sig_peptide 
<222> 72 . .203 
<223> Von Heijne matrix 
score 5 . 5 

seq ILFFTGWWIMIDA/AV 

<221> polyA_site 
<222> 1151. .1162 

<400> 374 

aaagtcggcg tggacgtttg aggaagctgg gatacagcat ttaatgaaaa atttatgctt 60 
aagaagtaaa a atg gca ggc ttc eta gat aat ttt cgt tgg cca gaa tgt 110 
Met Ala Gly Phe Leu Asp Asn Phe Arg Trp Pro Glu Cys 
-40 -35 



gaa 


tgt 


att 


gac 


tgg 


agt 


gag 


aga 


aga 


aat 


get 


gtg 


gca 


tct 


gtt 


gtc 


158 


Glu 


Cys 
-30 


He 


Asp 


Trp 


Ser 


Glu 
-25 


Arg 


Arg 


Asn 


Ala 


Val 
-20 


Ala 


Ser 


Val 


Val 




gca 


ggt 


ata 


ttg 


ttt 


ttt 


aca 


ggc 


tgg 


tgg 


ata 


atg 


att 


gat 


gca 


get 


206 


Ala 


Gly 


He 


Leu 


Phe 


Phe 


Thr 


Gly 


Trp 


Trp 


He 


Met 


He 


Asp 


Ala 


Ala 




-15 










-10 










-5 










1 




gtg 


gtg 


tat 


eet 


aag 


cca 


gaa 


eag 


ttg 


aae 


cat 


gee 


ttt 


cae 


aca 


tgt 


254 




Val 


Tyr 


Pro 
5 


Lys 


Pro 


Glu 


Gin 


Leu 
10 


Asn 


His 


Ala 


Phe 


His 
15 


Thr 


Cys 




Sfet 


gta 


ttt 


tec 


aca 


ttg 


get 


ttc 


ttc 


atg 


ata 


aat 


get 


gta 


tec 


aat 


302 


r?51y 


Val 


Phe 


Ser 


Thr 


Leu 


Ala 


Phe 


Phe 


Met 


He 


Asn 


Ala 


Val 


Ser 


Asn 






20 










25 










30 










^^HCt 


cag 


gtg 


aga 


ggt 


gat 


age 


tat 


gaa 


age 


ggc 


tgt 


tta 


gga 


aga 


aca 


350 




Gin 
35 


Val 


Arg 


Gly 


Asp 


Ser 
40 


Tyr 


Glu 


Ser 


Gly 


Cys 
45 


Leu 


Gly 


Arg 


Thr 




iifet 


get 


cga 


gtt 


tgg 


ett 


ttc 


att 


ggt 


ttc 


atg 


ttg 


atg 


ttt 


ggg 


tea 


398 


Gly 


Ala 


Arg 


Val 


Trp 


Leu 


Phe 


He 


Gly 


Phe 


Met 


Leu 


Met 


Phe 


Gly 


Ser 














55 










60 










65 




ii ptt 


att 


get 


tee 


atg 


tgg 


att 


ett 


ttt 


ggt 


gca 


tat 


gtt 


ace 


caa 


aat 


446 


j^^eu 


He 


Ala 


Ser 


Met 
70 


Trp 


He 


Leu 


Phe 


Gly 
75 


Ala 


Tyr 


Val 


Thr 


Gin 
80 


Asn 




'J;^ct 


gat 


gtt 


tat 


ccg 


gga 


eta 


get 


gtg 


ttt 


ttt 


caa 


aat 


gca 


ett 


ata 


494 


^^^^hr 


Asp 


Val 


Tyr 
85 


Pro 


Gly 


Leu 


Ala 


Val 
90 


Phe 


Phe 


Gin 


Asn 


Ala 
95 


Leu 


He 




ttt 


ttt 


age 


act 


etg 


ate 


tac 


aaa 


ttt 


gga 


aga 


ace 


gaa 


gag 


eta 


tgg 


542 


Phe 


Phe 


Ser 


Thr 


Leu 


He 


Tyr 


Lys 


Phe 


Gly 


Arg 


Thr 


Glu 


Glu 


Leu 


Trp 





100 105 110 



ace tgagateact tcttaagtea catttteett ttgttatatt ctgtttgtag 595 
Thr 

ataggttttt tatctctcag tacacattge caaatggagt agattgtaca ttaaatgttt 655 

tgtttettta eatttttatg ttctgagttt tgaaatagtt ttatgaaatt tctttatttt 715 

tcattgeata gaetgttaat atgtatataa taeaagacta tatgaattgg ataatgagta 775 

tcagtttttt atteetgaga tttagaaett gatctactec ctgagceagg gttacatcat 835 

ettgtcattt tagaagtaac caetettgte tctetggctg ggeaeggtgg eteatgcctg 895 

taateecage aetttgggag gccgaggegg gecgattget tgaggtcaag tgtttgagae 955 

eagcetggee aaeatggcga aaccccatct actaaaaata eaaaaattag ceaggeatgg 1015 

tggtgggtge ctgtaatccc aaetacctag gaggctgagg eaggagaate gettgaaecc 1075 

ggggggcaga ggttgyagtg agctgagttt gcgceactge actctagcct gggggagaaa 1135 

gtgaaactce cteteaaaaa aaaaaaame 1164 



<210> 375 



-315- 



<211> 1250 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 36, .425 



<221> sigjpeptide 

<222> 36. .119 

<223> Von Heijne matrix 

score 11.6000003814697 
seq LLLLVQLLRFLRA/DG 



<221> polyA_signal 
<222> 1215. .1220 



<221> polyA_site 
<222> 1240 . .1250 



<400> 375 

^,,.^atttcttccc cccgagctgg gcgtgcgcgg ccgca atg aac tgg gag ctg ctg 53 
'% Met Asn Trp Glu Leu Leu 

ii * -2 5 

ijctg tgg ctg ctg gtg ctg tgc gcg ctg etc ctg etc ttg gtg cag ctg 101 
..Cieu Trp Leu Leu Val Leu Cys Ala Leu Leu Leu Leu Leu Val Gin Leu 
J1 -20 -15 -10 

jjcitg cgc ttc ctg agg get gac ggc gac ctg acg eta eta tgg gee gag 149 
:J peu Arg Phe Leu Arg Ala Asp Gly Asp Leu Thr Leu Leu Trp Ala Glu 
j1 1 5 10 

'tgg cag gga cga cgc cca gaa tgg gag ctg act gat atg gtg gtg tgg 197 
Trp Gin Gly Arg Arg Pro Glu Trp Glu Leu Thr Asp Met Val Val Trp 

1^ 20 25 

i^tg act gga gee teg agt gga att ggt gag gag ctg get tac cag ttg 245 
'.^al Thr Gly Ala Ser Ser Gly lie Gly Glu Glu Leu Ala Tyr Gin Leu 
q 30 35 40 

jjct aaa eta gga gtt tct ctt gtg ctg tea gee aga aga gtg eat gag 293 
er Lys Leu Gly Val Ser Leu Val Leu Ser Ala Arg Arg Val His Glu 
45 50 55 

ctg gaa agg gtg aaa aga aga tgc eta gag aat ggc aat tta aaa gaa 341 
Leu Glu Arg Val Lys Arg Arg Cys Leu Glu Asn Gly Asn Leu Lys Glu 

60 65 70 

aaa gat ata ctt gtt ttg cec ctt gac ctg aee gac act ggt tec cat 389 
Lys Asp He Leu Val Leu Pro Leu Asp Leu Thr Asp Thr Gly Ser His 
'^5 80 85 90 

gaa age ggc tac eaa age tgt tet eca gga att tgg tagaatcgae 435 
Glu Ser Gly Tyr Gin Ser Cys Ser Pro Gly He Trp 

95 100 
attctggtca acaatgtgga aatgtcecag cgttctctgt gcatggatae caacttggat 495 
gtctacagaa agctaatgag agcttaacta cttagggacg gtgtccttga caaaatgtgk 555 
ketgcctcac atgatcgaga ngaareaagg aaagattgtt actgtgaata gcateetggg 615 
tatcatatct gtacctcttt ccattggata ctgtgctagc aagcatgetc tccggggktk 675 
ktttaatggc ettcraacag aacttgccae atacceargt ataatagttt ctaacatttg 735 
cccaggacct gtgcaateaa atattgtgga aaattcccta gctggagaag tcacaaagae 795 
tataggeaat aatggagace agtcccacaa gatgacaaec agtegttgtg tgeggctgat 855 
gttaatcagc atggeeaatg atttgaaaga agtttggate teagaacaac etttcttgtt 915 
agtaacatat ttgtggeaat aeatgceaae ctgggcctgg tggataacea aeaagatggg 975 



-316- 



gaagaaaagg attgagaact ttaagagtgg tgtggatgca gactcttctt attttaaaat 1035 

ctttaagaca aaacatgact gaaaagagca cctgtacttt tcaagccact ggagggagaa 1095 

atggaaaaca tgaaaacagc aatcttctta tgcttctgaa taatcaaaga ctaatttgtg 1155 

attttacttt ttaatagata tgactttgct tccaacatgg aatgaaataa aaaataaata 1215 

ataaaagatt gccatgaatc ttgcaaaaaa aaaaa 1250 



<210> 376 
<211> 947 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 155. .751 

<221> sigjpeptide 

<222> 155. .340 

<223> Von Heijne matrix 

score 3.70000004768372 

seq SILGIISVPLSIG/YC 

:j^221> polyA_signal 
^^^222> 912 . . 917 

iflc221> polyA^site 
:|<222> 93 7. .94 7 

i|J^|c400> 376 

agtgaaaaga agatgcctag agaatggcaa tttaaaagaa aaagatatac ttgttttgcc 60 

^^^^cttgacctg accgacactg gttcccatga agcggctacc aaagctgttc tccaggagtt 120 

nfggtagaatc gacattctgg tcaacaatgg tgga atg tec cag cgt tct ctg tgc 175 



Met Ser Gin Arg Ser Leu Cys 
-60 
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age 


ttg 
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rag 
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ata 


gag 


ett 


aac 


tac 


223 


^J|let 


Asp 


Thr 
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Leu 
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Val 


Tyr 


Arg 
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Leu 


He 


Glu 


Leu 


Asn 


Tyr 




p5-55 










-50 










-45 










-40 




^ tta 


ggg 


acg 


gtg 


tec 


ttg 


aca 


aaa 


tgt 


gtt 


ctg 


cct 


cac 


atg 


ate 


gag 


271 


Leu 


Gly 


Thr 


Val 


Ser 


Leu 


Thr 


Lys 


Cys 


Val 


Leu 


Pro 


His 


Met 


He 


Glu 












-35 










-30 










-25 






agg 


aag 


caa 


gga 


aag 


att 


gtt 


act 


gtg 


aat 


age 


ate 


ctg 


ggt 


ate 


ata 


319 


Arg 


Lys 


Gin Gly 


Lys 


He 


Val 


Thr 


Val 


Asn 


Ser 


He 


Leu 


Gly 


He 


He 










-20 










-15 










-10 








tct 


gta 


cct 


ett 


tee 


att 


gga 


tac 


tgt 


get 


age 


aag 


eat 


get 


etc 


egg 


367 


Ser 


Val 


Pro 
-5 


Leu 


Ser 


He 


Gly 


Tyr 
1 


Cys 


Ala 


Ser 


Lys 


His 


Ala 


Leu 


Arg 




ggt 


ttt 


ttt 


aat 


ggc 


ctt 


cga 


aea 


gaa 


ett 


gee 


5 

aca 


tac 


cca 


ggt 


ata 


415 


Gly 


Phe 


Phe 


Asn 


Gly 


Leu 


Arg 


Thr 


Glu 


Leu 


Ala 


Thr 


Tyr 


Pro 


Gly 


He 




10 










15 










20 






25 




ata 


gtt 


tct 


aac 


att 


tgc 


eca 


gga 


cct 


gtg 


caa 


tea 


aat 


att 


gtg 


gaa 


463 


He 


Val 


Ser 


Asn 


He 


Cys 


Pro 


Gly 


Pro 


Val 


Gin 


Ser 


Asn 


He 


Val 


Glu 












30 










35 










40 






aat 


tec 


eta 


get 


gga 


gaa 


gtc 


aca 


aaa 


act 


ata 


ggc 


aat 


aat 


gga 


aac 


511 


Asn 


Ser 


Leu 


Ala 


Gly 


Glu 


Val 


Thr 


Lys 


Thr 


He 


Gly Asn 


Asn 


Gly 


Asn 










45 










50 










55 






cag 


tec 


cac 


aag 


atg 


aca 


ace 


agt 


cgt 


tgt 


gtg 


egg 


ctg 


atg 


tta 


ate 


559 



-317- 



Gin 


Ser 


His 


Lys 


Met 


Thr 


Thr 


Ser 


Arg 


Cys 


Val 


Arg 


Leu 


Met 


Leu 


He 






60 










65 










70 








age 


atg 


gcc 


aat 


gat 


ttg 


aaa 


gaa 


gtt 


tgg 


ate 


tea 


gaa 


caa 


cct 


ttc 


Ser 


Met 


Ala 


Asn 


Asp 


Leu 


Lys 


Glu 


Val 


Trp 


He 


Ser 


Glu 


Gin 


Pro 


Phe 




75 










80 










85 










ttg 


tta 


gta 


aca 


tat 


ttg 


tgg 


caa 


tac 


atg 


cca 


acc 


tgg 


gcc 


tgg 


tgg 


Leu 


Leu 


Val 


Thr 


Tyr 


Leu 


Trp 


Gin 


Tyr 


Met 


Pro 


Thr 


Trp 


Ala 


Trp 


Trp 


90 










95 










100 










105 


ata 


acc 


aac 


aag 


atg 


ggg 


aag 


aaa 


agg 


att 


gag 


aac 


ttt 


aag 


agt 


ggt 


He 


Thr 


Asn 


Lys 


Met 


Gly 


Lys 


Lys 


Arg 


He 


Glu 


Asn 


Phe 


Lys 


Ser 


Gly 










110 










115 










12 0 




gtg 


gat 


gem 


rac 


tct 


tct 


tat 


ttt 


aaa 


ate 


ttt 


aag 


aca 


aaa 


cat 


gac 


Val 


Asp 


Ala 


Xaa 


Ser 


Ser 


Tyr 


Phe 


Lys 


He 


Phe 


Lys 


Thr 


Lys 


His 


Asp 








125 










130 










135 







tgaaaaganc acctgtactt ttcaagccac tggagggaga aatggaaaac atgaaaacag 
caatcttctt atgcttctga ataatcaaag actaatttgt gattttactt tttaatagat 
atgactttgc ttccaacatg grrtgaaata aaaaataaat aataaaagat tgccatgrrt 
cttgcaaaaa aaaaaa 



,^<210> 377 
^f<211> 621 
0<212> DNA 
;J<213> Homo sapiens 

|k22 0> 
|S<221> CDS 
•k:222> 46. .585 

"""<221> sig_j)eptide 
,<222> 46, .120 

."j'>223> Von Heijne matrix 

score 6.30000019073486 
seq AFSLSVMAALTFG/CF 

U:221> polyA^signal 
;!j:222> 584. .589 

<221> polyA_site 
<222> 606. .619 



<400> 377 

aactgggtgt gcgtrtggag tccggactcg tgggagacga tcgcg atg aac acg gtg 

Met Asn Thr Val 
-25 

ctg teg egg gcg aac tea ctg ttc gcc ttc teg etg age gtg atg gcs 
Leu Ser Arg Ala Asn Ser Leu Phe Ala Phe Ser Leu Ser Val Met Ala 

-20 -15 -10 

gcg etc acc ttc ggc tgc ttc ate ayy acc gcc ttc aaa gac agg age 
Ala Leu Thr Phe Gly Cys Phe He Xaa Thr Ala Phe Lys Asp Arg Ser 
-5 15 10 

gtc ccg gtg egg ctg cac gte teg cga ate atg eta aaa aat gta gaa 
Val Pro Val Arg Leu His Val Ser Arg He Met Leu Lys Asn Val Glu 

15 20 25 

gat ttc act gga cct aga gaa aga agt gat etg gga ttt ate aca ttt 
Asp Phe Thr Gly Pro Arg Glu Arg Ser Asp Leu Gly Phe He Thr Phe 



-318- 

30 35 40 

gat ata act get gat eta gag aat ata ttt gat tgg aat gtt aag cag 297 
Asp lie Thr Ala Asp Leu Glu Asn He Phe Asp Trp Asn Val Lys Gin 

45 50 55 

ttg ttt ctt tat tta tea gca gaa tat tea aea aaa aat aat get ctg 345 
Leu Phe Leu Tyr Leu Ser Ala Glu Tyr Ser Thr Lys Asn Asn Ala Leu 
60 65 70 75 

aae caa ktt gtc eta tgg gae aag att gtt ttg aga ggt gat aat ccg 393 
Asn Gin Xaa Val Leu Trp Asp Lys He Val Leu Arg Gly Asp Asn Pro 

80 85 90 

aag ctg ctg ctg aaa gat atg aaa aea aaa tat ttt ttc ttt gae gat 441 
Lys Leu Leu Leu Lys Asp Met Lys Thr Lys Tyr Phe Phe Phe Asp Asp 

95 100 105 

gga aat ggt etc wag gga aac agg aat gtc act ttg acc ctg tet tgg 489 
Gly Asn Gly Leu Xaa Gly Asn Arg Asn Val Thr Leu Thr Leu Ser Trp 

110 115 120 

aac gtc gta cca aat get gga att eta ect ctt gtg aea gga tea gga 537 
Asn Val Val Pro Asn Ala Gly He Leu Pro Leu Val Thr Gly Ser Gly 

125 130 135 

cac gta tct gtc cca ttt cea gat aea tat gaa ata acg aag agt tat 585 
His Val Ser Val Pro Phe Pro Asp Thr Tyr Glu He Thr Lys Ser Tyr 
.■^if40 145 150 155 

'J'fcaaattatte tgaatttgaa acaaaaaaaa aaaahm 621 



Ul:210> 378 

4<211> 52 

ii,l:212> PRT 

yi:213> Homo sapiens 

^,|:220> 

j^^|:221> SIGNAL 
i; '*<:222> -20 . . -1 

"';l:400> 378 

^iHet Pro Ser Val Asn Ser Ala Gly Leu Cys Val Leu Gin Leu Thr Thr 
0;|20 -15 -10 -5 

Ala Val Thr Ser Ala Phe Leu Leu Ala Lys Val Asn Pro Phe Glu Xaa 

15 10 
Phe Leu Ser Arg Gly Phe Trp Leu Cys Ala Ala His His Phe He His 

15 20 25 

Pro Cys Leu Asp 
30 



<210> 379 
<211> 193 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -23 . . -1 



<400> 379 



-319- 



MpI- VaT 


T .01 1 


Arg Ala Gly 


Lys 




O A 

- zO 








AX a. irne 


AX a 


Cys Arg Gly 


Cys 


-5 








1 


\jXu. riiy 


Asp 


Thr Ala 


Pro 


Ser 






15 






Airg Xtys oer 


Cys 


His Asp 


Trp 


He 






30 






T.^n 2iy"rT Dv*/^ 

UC^U. f\^y irJLV^ 


Val 


His Phe 


Tyr 


xxe 




45 








vjrj.ll ijyS J-iSU 


Arg 


Lys Leu 


Arg 


Gin 


O 0 








65 


rTLe lip AX a 


Asn 


Gin Asn 


Leu 


Thr 


75 






80 




xxe rixs ber 


Arg Leu Lys 


Thr 


Lys 


y 0 




95 






vjxy Lrxn Xiys 


Ala 


Thr Leu 


Asn 


Ala 






110 






Glu Phe Leu 


Ser 


Lys Asn 


Phe 


Gin 




125 








Trp Tyr Lys 


Arg 


Asn Phe 


Ala 


He 


14 0 








145 


jteu Glu Arg 


He 


Trp Asn Lys 


Leu 


It 155 






160 





^'Asn 
5^170 



|J|210> 380 

,;<211> 82 

s,<:212> PRT 

j^^f213> Homo sapiens 

I f 220> 

'^J221> SIGNAL 
^^l!l:222> -14 . . -1 



<400> 380 












Met Ala 


Phe 


Thr 


Leu 
-10 


Xaa 


Ser 


Leu 


Asn Ala 


He 
5 


Ala 


Val 


Leu 


His 


Glu 
10 


Trp Gly 


Thr 


Asp Gin Gly He 


Gly 


20 










25 




Lys Ser 


Xaa 


Xaa 


Met 


Xaa 


Leu 


He 


35 








40 






Val Pro 


Leu 


He 


He 
55 


Val 


Asn 


Ser 


Phe Gly 















Lys 


Thr 


Phe 


Leu 


Pro 


Pro 


Leu 


Xaa 


-15 










-10 






Gin 


Leu 


Ala 


Pro 

c 


Glu 


Arg Gly Ala 


Gly Val 


Ser Arg 


Phe 


Cys 


Pro 


Pro 






20 










25 


Gly Pro Pro Asp 


Lys 


Tyr 


Ser 


Asn 




35 










40 




Pro 


Glu 


Asn 


Glu 


Ser 


Pro 


Leu 


Glu 


50 










55 






Glu 


Thr 


Gin 


Glu 


Trp 


Asn 


Gin 


Gin 










70 








Phe 


Ser 


Lys 


Glu 


Lys 


Glu 


Glu 


Phe 








85 










Gly Leu Gly Leu 


Arg 


Thr 


Glu 


Ser 






100 










105 


Glu 


Glu 


Met 


Ala 


Asp 


Phe 


Tyr 


Lys 




115 










120 




Lys 


His 


Met 


Tyr 


Tyr 


Asn Arg Asp 


130 










135 






Thr 


Phe 


Phe 


Met 


Gly 


Lys 


Val 


Ala 










150 








Lys 


Gin 


Lys 


Gin 


Lys 


Lys 


Arg 


Ser 



165 



Leu Gin Ala Ala Leu Leu Cys Val 

-5 1 
Glu Arg Phe Leu Lys Asn He Gly 
15 

Gly Phe Gly Glu Glu Pro Gly He 
30 

Arg Ser Val Arg Thr Val Met Arg 

45 50 
He Ala He Val Leu Leu Leu Leu 
60 65 



<210> 381 
<211> 198 
<212> PRT 
<213> Homo sapiens 



-320- 



<220> 

<221> SIGNAL 
<222> -21. .-1 



<400> 381 

Met Pro Val Pro Ala 
-20 

Arg Pro Ala Ser Ala 
-5 

Glu Glu Leu Thr Leu 
15 

Leu Asn Gly Val Tyr 
30 

Asn Ser Leu Gly Leu 
45 

Val Ser Arg Gly Arg 
60 

Glu Thr Gin Met Glu 
80 

Ala Glu Val Leu Gly 
55 

^':Jer Val Gin Arg Leu 
110 

^■4la Tyr Arg Lys Phe 
-L25 

ylbis He Leu Trp Ala 
4140 

jpet Val Ala Gin Gin 
jl 1^0 
j'Thr Ala Ala Leu Pro 
175 



Leu Cys Leu Leu Trp Ala 
-15 

Ala Pro Met Gly Gly Pro 
1 5 
Leu Phe His Gly Thr Leu 
20 

Arg Thr Thr Glu Gly Arg 
35 

Tyr Gly Arg Thr He Glu 
50 

Asp Ala Ala Gin Glu Leu 
65 70 
Glu Asp He Leu Xaa Leu 
85 

Glu Val Ala Gin Ala Gin 
100 

Xaa Xaa Gin Leu Xaa Xaa 
115 

Glu Val Leu Lys Ala Pro 
130 

Leu Thr Gly His Val Xaa 
145 150 
Xaa Xaa Leu Xaa Gin lie 
165 

Ala 



Leu Ala Met Val Thr 
-10 

Glu Leu Ala Gin His 
10 

Gin Leu Gly Gin Ala 
25 

Leu Thr Lys Ala Arg 
40 

Leu Leu Gly Gin Glu 
55 

Arg Ala Ser Leu Leu 
75 

Gin Ala Xaa Ala Thr 
90 

Lys Val Leu Arg Asp 
105 

Ala Trp Leu Gly Pro 
120 

Pro Xaa Lys Gin Asn 
135 

Arg Gin Xaa Arg Glu 
155 

Gin Glu Lys Leu His 
170 



"4210> 382 

lil:211> 160 

1:1:2 12 > PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 

<222> -55 . . -1 



<400> 382 












Met 


Asp Lys 


Leu 


Lys 


Lys 


Val 


Leu 


-55 








-50 






Ser 


Gly Leu 


Ser 


Glu 
-35 


Val 


Val 


Glu 


Arg 


He Lys 


Gly 
-20 


Phe 


He 


Ala 


Cys 


Leu 


Leu Gly 
-5 


Thr 


Val 


Leu 


Leu 


Trp 
1 


Phe 


Ala Val 


Phe 


Tyr 


Thr 


Phe Gly 


10 








15 






He 


Phe Leu 


Met 


Gly Pro Val 


Lys 



30 



Ser Gly Gin Asp Thr Glu Asp Arg 
-45 -40 
Ala Ser Ser Leu Ser Trp Ser Thr 

-30 -25 
Phe Ala He Gly He Leu Cys Ser 
-15 -10 
Val Pro Arg Lys Gly Leu His Leu 
5 

Asn He Ala Ser He Gly Ser Thr 

20 25 
Gin Leu Lys Arg Met Phe Glu Pro 
35 40 



-321- 



Thr Arg Leu lie Ala Thr lie Met 
45 

Leu Cys Ser Ala Phe Trp Trp His 

60 65 
Cys lie Leu Gin Ser Leu Ala Leu 

75 80 
Pro Phe Ala Arg Asp Ala Val Lys 
90 95 



Val Leu Leu Cys Phe Ala Leu Thr 
50 55 
Asn Lys Gly Leu Ala Leu He Phe 
70 

Thr Trp Tyr Ser Leu Ser Phe He 
85 

Xaa Cys Phe Ala Val Cys Leu Ala 
100 105 



<210> 383 

<211> 108 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 

<222> -18. .-1 

<400> 383 

..n^et Lys Ala Leu Cys Leu Leu Leu Leu Pro Val Leu Gly Leu Leu Val 

% -15 -10 -5 

;!$er Ser Lys Thr Leu Cys Ser Met Glu Glu Ala He Asn Glu Arg He 

rf =^5 10 

f^ln Glu Val Ala Gly Ser Leu He Phe Arg Ala He Ser Ser He Gly 
Hl-^ 20 25 30 

^lArg Gly Ser Glu Ser Val Thr Ser Arg Gly Asp Leu Ala Thr Cys Pro 
yj 35 40 45 

y!|^rg Gly Phe Ala Val Thr Gly Cys Thr Cys Gly Ser Ala Cys Gly Ser 
=:i 50 55 60 

rp Asp Val Arg Ala Glu Thr Thr Cys His Cys Gin Cys Ala Gly Met 
hi 65 70 75 

i>sp Trp Thr Gly Ala Arg Cys Cys Arg Val Gin Pro 
^0 85 90 



<210> 384 
<211> 64 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -22. .-1 

<400> 384 

Met He Ser Arg Gin Leu Arg Ser Leu Ser Cys Leu Cys Pro Ala Leu 

-20 -15 -10 

Phe Pro Gly Thr Ser Ser Phe He Val Ala Leu Ser Ser Pro Ala Asp 

"^15 10 
Leu Tyr He Pro Xaa Arg Xaa Arg Ser Asp Glu Leu Val Phe Glu Ser 

15 20 25 

Gin Lys Gly Ser Ala Met Glu Leu Ala Val He Thr Val Xaa Gly Val 
30 35 40 



-322- 



<210> 385 

<211> 27 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -15. .-1 

<400> 385 

Met Gly Phe Leu Xaa Leu Met Thr 
-15 -10 
Ala Lys Pro Asn Glu Gin Pro Trp 
5 



Leu Thr Thr His Val His Ser Ser 

-5 1 
Leu Leu Asn 
10 



<210> 386 

<211> 186 
,,,<212> PRT 
-^213> Homo sapiens 

^ic22 0> 

^=^221> SIGNAL 
J'lc222> -21. .-1 

^:k400> 386 

J'jHet Ser Pro Ser Gly Arg Leu Cys Leu Leu Thr He Val Gly Leu He 
" -20 -15 -10 

^,Leu Pro Thr Arg Gly Gin Thr Leu Lys Asp Thr Thr Ser Ser Ser Ser 
if^ 15 10 

lAla Asp Ser Thr He Met Asp He Gin Val Pro Thr Arg Ala Pro Asp 

15 2 0 25 

%kla Val Tyr Thr Glu Leu Gin Pro Thr Ser Pro Thr Pro Thr Trp Pro 
j!l 30 35 40 

rnia Asp Glu Thr Pro Gin Pro Gin Thr Gin Thr Gin Gin Leu Glu Gly 
45 50 55 

Thr Asp Gly Pro Leu Val Thr Asp Pro Glu Thr His Xaa Ser Xaa Lys 
60 65 70 75 

Ala Ala His Pro Thr Asp Asp Thr Thr Thr Leu Ser Glu Arg Pro Ser 

80 85 90 

Pro Ser Thr Xaa Val His Xaa Arg Pro Xaa Xaa Pro Ser Xaa His Leu 

95 100 105 

Val Phe Met Arg Met Thr Pro Ser Ser Met Met Asn Thr Pro Ser Gly 

110 115 120 

Asn Xaa Gly Cys Trp Ser Gin Leu Cys Cys Ser Ser Gin Ala Ser Ser 

125 130 135 

Ser Ser Pro Val Ala Ser Ala Gly Ser Cys Pro Gly Tyr Ala Gly He 
140 145 150 155 

He Ala Gly Glu Ser He Arg Asn Arg Ser 
160 165 



<210> 387 



-323- 



<211> 179 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -26, . -1 



<400> 387 



Met Glu Thr 


Gly 


Ala 


Leu 


Arg Arg 


-25 








-20 


Leu Leu Cys 


Gly 


Pro 


Ser 


Gin Asp 


-10 






-5 




Leu Leu Gin 


Ser 


Pro 


Gly 


Leu Thr 




10 








Arg Glu Gly 


Lys 


Glu 


Gly 


Gly Asp 


25 








30 


Thr Pro Ala 


Arg 


Ser 


Pro 


Gin Gly 


40 








45 


Arg Lys Val 


Lys 


Gly Pro 


Ala Trp 


55 






60 




..,,Asn Arg Met 


Arg 


Ser 


Leu 


Ser Ser 






75 






ffelu Gly Gly 


Thr 


Val 


Lys 


Leu Gin 




90 








"^^■Xaa Asp Gly 


Met 


Ala 


Ser 


Leu Glu 


il 105 








110 


IJiAsp Ala Cys 


Pro 


Gly Phe 


His Asp 


4 =^20 








125 


J-piy Leu Ser 


Gly 


Xaa 


Xaa 


Leu Trp 


1^ 13 5 






140 




,.11 e Xaa Leu 











Pro Gin Leu 


Leu 


Pro 


Leu 


Leu Leu 




-15 








Gin Cys Arg 


Pro 


Val 


Leu 


Gin Asn 


1 








5 


Trp Ser Leu 


Glu 


Val 


Pro 


Thr Gly 


15 






20 




Arg Gly Pro 


Gly 


Leu 


Xaa 


Gly Ala 






35 






Lys Glu Met 


Gly 


Arg Gin Arg Thr 




50 








Xaa His Thr 


Ala 


Asn 


Gin 


Glu Leu 


65 








70 


Gly Ser Val 


Pro 


Val 


Gly His Leu 


80 








85 


Lys Asp Thr 


Gly 


Leu 


His 


Ser Cys 


95 






100 




Gly Thr Pro 


Ala 


Ser 


Val 


Leu Ala 






115 






Val Xaa Val 


Gin 


Xaa 


Ala 


Leu Phe 




130 








Leu Lys Thr 


His 


Phe 


Cys 


Leu Ser 


145 








150 



;4c:210> 388 
I3k211> 150 
:j|c212> PRT 
<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -55. .-1 



<400> 388 












Met 


Ala 


Thr 


Thr 


Val 


Pro 


Asp Gly Cys Arg Asn Gly Leu Lys Ser Lys 


-55 










-50 




-45 -40 


Tyr 


Tyr 


Arg 


Leu 


Cys 


Asp 


Lys 


Ala Glu Ala Trp Gly He Val Leu Glu 










-35 






-30 -25 


Thr 


Val 


Ala 


Thr 


Ala 


Gly Val 


Val Thr Ser Val Ala Phe Met Leu Thr 








-20 








-15 -10 


Leu 


Pro 


He 
-5 


Leu 


Val 


Cys 


Lys 


Val Gin Asp Ser Asn Arg Arg Lys Met 


Leu 


Pro 


Thr 


Gin 


Phe 


Leu 


Phe 


1 5 

Leu Leu Gly Val Leu Gly He Phe Gly 


10 










15 




20 25 


Leu 


Thr 


Phe 


Ala 


Phe 


He 


He 


Gly Leu Asp Gly Ser Thr Gly Pro Thr 










30 






35 40 



-324- 



Arg Phe Phe Leu Phe Gly lie Leu 
45 

Leu Ala His Ala Val Ser Leu Thr 

60 65 
Pro Phe Pro Val Gly Asp Ser Gly 

75 80 
Pro Gly Cys Tyr Arg Tyr 
90 95 



Phe Ser lie Cys Phe Ser Cys Leu 
50 55 
Lys Leu Val Arg Gly Arg Lys Ala 
70 

Ser Gly Arg Gly Leu Gin Pro Ser 
85 



<210> 389 
<211> 236 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -31. .-1 



<400> 389 



i^l^et Leu 


Ser 


Lys Gly Leu Lys Arg Lys 


Arg Glu Glu 


Glu 


Glu Glu Lys 


-30 










-25 






-20 




'fOlu Pro Leu 


Ala 


Val 


Asp 


Ser 


Trp 


Trp 


Leu Asp Pro 


Gly His Ala Ala 










-10 








-5 




1 


f;Val Ala 


Gin 


Ala 


Pro 


Pro 


Ala 


Val 


Ala 


Ser Ser Ser 


Leu 


Phe Asp Leu 






5 










10 






15 


iper Val 


Leu 


Lys 


Leu 


His 


His 


Ser 


Leu 


Gin Xaa Ser 


Xaa 


Pro Asp Leu 




20 










25 






30 


y%rg His 


Leu 


Val 


Leu 


Val 


Xaa 


Asn 


Thr 


Leu Arg Arg 


He 


Gin Ala Ser 












40 






45 






imt Ala 


Pro 


Ala 


Ala 


Ala 


Leu 


Pro 


Pro 


Val Pro Thr 


Pro 


Pro Ala Ala 










55 








60 




65 


: pro Xaa 


Val 


Ala 


Asp 


Asn 


Leu 


Leu 


Ala 


Ser Ser Asp 


Ala 


Ala Leu Ser 








70 










75 




80 


'■4la Ser 


Met 


Ala 


Xaa 


Leu 


Leu 


Glu 


Asp 


Leu Ser His 


He 


Glu Gly Leu 






85 










90 






95 


y$er Gin Ala 


Pro 


Gin 


Pro 


Leu 


Ala 


Asp 


Glu Gly Pro 


Pro 


Gly Arg Ser 




100 










105 






110 


lie Gly Gly 


Xaa 


Pro 


Pro 


Xaa 


Leu Gly 


Ala Leu Asp 


Leu 


Leu Gly Pro 


115 










120 






125 




Ala Thr 


Gly 


Cys 


Leu 


Leu 


Asp 


Asn 


Gly 


Leu Glu Gly 


Leu 


Phe Glu Asp 


130 








135 








140 




145 


lie Asp 


Thr 


Ser 


Met 


Tyr Asp Asn Glu 


Leu Trp Ala 


Pro 


Ala Ser Glu 








150 










155 




160 


Gly Leu Lys 


Pro Gly Pro Glu Asp Gly 


Pro Gly Lys 


Glu 


Glu Ala Pro 






165 










170 






175 


Glu Leu Asp 


Glu 


Ala 


Glu 


Leu Asp 


Tyr 


Leu Met Asp 


Val 


Leu Val Gly 




180 










185 






190 


Thr Gin 


Ala 


Leu 


Glu Arg Pro 


Pro 


Gly 


Pro Gly Arg 






195 










200 






205 







<210> 390 
<211> 149 
<212> PRT 



-325- 



<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -100 . . -1 



<400> 390 



Met Glu 


Thr 


Leu 


Tyr Arg Val 


Pro Phe Leu 


Val Leu Glu 


Cys 


Pro 


Asn 


-100 






-95 




-90 






-85 


Leu Lys 


Leu 


Lys 


Lys Pro Pro 


Trp Leu His 


Met Pro Ser 


Ala 


Met 


Thr 








-80 


-75 






-70 




Val Tyr 


Ala 


Leu 


Val Val Val 


Ser Tyr Phe 


Leu He Thr 


Gly Gly He 






-65 




-60 




-55 






lie Tyr 


Asp 


Val 


He Val Glu 


Pro Pro Ser 


Val Gly Ser 


Met 


Thr 


Asp 




-50 






-45 


-40 






Glu His 


Gly 


His 


Gin Arg Pro 


Val Ala Phe 


Leu Ala Tyr 


Arg 


Val 


Asn 


-35 






-30 




-25 








Gly Gin 


Tyr 


He 


Met Glu Gly 


Leu Ala Ser 


Ser Phe Leu 


Phe 


Thr 


Met 


-20 






-15 




-10 






-5 


Gly Gly 


Leu 


Gly 


Phe He He 


Leu Asp Gly Ser Asn Ala 


Pro 


Asn 


He 








1 


5 




10 






!:.fPro Lys 


Leu 


Asn 


Arg Phe Leu 


Leu Leu Phe 


He Gly Phe 


Val 


Cys 


Val 




15 






20 


25 






Ifteu Xaa 


Ser 


Phe 


Xaa Xaa Ala 


Arg Val Phe 


Met Arg Met 


Lys 


Leu 


Pro 


30 






35 




40 








y'hly Tyr 


Leu 


Met 


Gly 












Ul5 



















<210> 391 

y<211> 69 

J:^|f:212> PRT 

r^i:213> Homo sapiens 

''4:220> 

^l|:221> SIGNAL 
i;i|:222> -49. .-1 



<400> 391 

Met Pro Phe His 

Leu Thr Pro Cys 
-30 

His Leu Cys Pro 
-15 

Xaa Xaa Pro Ser 
1 

Phe Phe He Pro 



Phe Pro Phe Leu 
-45 

Leu Thr Val Pro 

His Leu Pro Phe 
-10 

Cys Leu Pro Ser 
5 

Asp 
20 



Gly Phe Val Cys 
-40 

Arg Arg Pro Leu 
-25 

Leu Leu Leu Leu 

Ser Ser Thr Cys 
10 



Leu His Leu His 
-35 

Phe Leu Leu Leu 
-20 

Ser Cys Val Gly 
-5 

Val Ser Leu His 
15 



<210> 392 
<211> 241 
<212> PRT 

<213> Homo sapiens 



-326- 



<220> 

<221> SIGNAL 
<222> -30. . -1 



<400> 392 





Gly Thr Ala 


Ser Arg 


Ser Asn He Ala 


Arg 


His 


Leu 


Gin 


Thr 


Asn 










-25 




-20 










-15 


Leu 


He 


Leu 


Phe 


Cys Val 


Gly Ala Val Gly Ala 


Cys 


Thr 


Leu 


Ser 


Val 










-10 


-5 










1 




Thr 


Gin 


Pro 


Trp 


Tyr Leu 


Glu Val Asp Tyr 


Thr 


His 


Glu 


Ala 


Val 


Thr 






5 






10 






15 








lie 


Lys 


Cys 


Thr 


Phe Ser 


Ala Thr Gly Cys 


Pro 


Ser 


Glu 


Gin 


Pro 


Thr 




20 








25 




30 










Cys 


Leu 


Trp 


Phe 


Arg Tyr Gly Ala His Gin 


Pro 


Glu 


Asn 


Leu 


Cys 


Leu 


35 








40 




45 








50 


Asp 


Gly Cys 


Lys 


Ser Glu 


Ala Xaa Lys Phe 


Thr 


Val 


Arg 


Glu 


Ala 


Leu 










55 


60 










65 




Lys 


Glu 


Asn 


Gin 


Val Ser 


Leu Thr Val Asn Arg 


Val 


Thr 


Ser 


Asn 


Asp 








70 




75 








80 




Ser 


Ala 


He 


Tyr 


He Cys 


Gly He Ala Phe 


Pro 


Ser 


Val 


Pro 


Glu 


Ala 






85 






90 






95 








^■^^ 


Ala 


Lys 


Gin 


Thr Gly Gly Gly Thr Thr Leu 


Val 


Val 


Arg 


Glu 


He 




100 








105 




110 








^^iys 


Leu 


Leu 


Ser 


Lys Glu 


Leu Arg Ser Phe 


Leu 


Thr 


Ala 


Leu 


Val 


Ser 










120 




125 










130 


yLeu 


Leu 


Ser 


Val 


Tyr Val 


Thr Gly Val Cys 


Val 


Ala 


Phe 


He 


Leu 


Leu 










135 


140 










145 




yj$er 


Lys 


Ser 


Lys 


Ser Asn 


Pro Leu Arg Asn 


Lys 


Glu 


He 


Lys 


Glu 


Asp 








150 




155 








160 






ii: Ser 


Gin 


Lys 


Lys 


Lys Ser 


Ala Arg Arg He 


Phe 


Gin 


Glu 


He 


Ala 


Gin 






165 






170 






175 










Leu 


Tyr 


His 


Lys Arg His Val Glu Thr 


Asn 


Gin 


Gin 


Ser 


Glu 


Lys 




180 








185 




190 








■ Asp 


Asn 


Asn 


Thr 


Tyr Glu 


Asn Arg Arg Val 


Leu 


Ser 


Asn 


Tyr 


Glu 


Arg 


''495 








200 




205 








210 



^Ji>ro 



<210> 393 
<211> 47 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -30. . -1 

<400> 393 

Met Asn Cys Asn Val Val Ser Glu 
-30 -25 
Cys Ser Leu Met Thr Cys Thr Thr 
-10 

Thr Asn Thr Leu Thr Asp Met Gly 
5 10 



Arg Gly Lys Trp Leu Glu Val Glu 
-20 -15 
Leu He Asn Ala Ser Ala He Ser 

-5 1 
Ser Phe Asp Arg Arg Glu Ser 
15 



-327- 



<210> 394 
<211> 65 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -28. .-1 

<400> 394 

Met Ala Phe Gly Leu Gin Met Phe 
-25 

Leu Gin Trp Ser Leu Leu Val Ala 

-10 -5 
Tyr Gly Val Thr Arg Val Glu Ser 
5 10 
Phe Leu Glu Thr Gly Gin Leu Pro 
25 

Ser 



He Gin Arg Lys Phe Pro Tyr Pro 
-20 -15 
Val Val Ala Gly Ser Val Val Ser 
1 

Glu Lys Cys Asn Asn Leu Trp Leu 

15 20 
Lys Asp Arg Ser Thr Asp Gin Xaa 
30 35 



]-i:210> 395 

:'|211> 73 

Uk212> PRT 

"l<213> Homo sapiens 

yj 

y|:220> 

; <221> SIGNAL 
^^^^:222> -24 . . -1 

]<:400> 395 

"fet Thr Cys Trp Met 

-20 

lieu Trp Leu Gly Pro 

3;i -5 

Pro Asp Pro Gly Val 
10 

Ser Pro Gly Asn Tyr 
25 

Trp Gly Gin Gly Thr 
45 



Leu Pro Pro He Ser Phe 
-15 

He Trp Pro Cys Ser Gly 
1 

Trp Pro Ser Leu Phe Arg 
15 

Ala Leu Ser Arg Gly Xaa 
30 35 
His Ser Ser Leu 



Leu Ser Tyr Leu Pro 
-10 

Ser Thr Leu Gly Lys 
5 

Pro Trp Asp Ala Ala 
20 

Asn Xaa Tyr Xaa Xaa 
40 



<210> 396 

<211> 60 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -18 . . -1 



<400> 396 



-328- 



Met Pro Cys Pro 
-15 

Ser Ser His Ala 
1 

Thr Leu Thr Gin 
15 

Gin Gly Thr Leu 



Thr Trp Thr Cys 

Ser Ser Leu His 
5 

Thr Leu Arg Thr 
20 

Thr Arg Leu Gin 
35 



Leu Lys Ser Phe 
-10 

Leu Pro Pro Ser 
10 

Gly Met His Leu 
25 

Ser Thr Pro Ala 
40 



Pro Ser Pro Thr 
-5 

Cys Thr Arg Leu 

Ser Arg Ala Leu 
30 



<210> 397 
<211> 192 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -93. .-1 



<400> 397 
f^et Ala Glu Leu 

'^yr Met Arg Phe 

;'|\.sp Ser Cys Phe 
-60 

'll'he Thr lie Asp 

yfal His Ser Glu 

l^.l^'al Leu Leu Leu 

iij 

^^|jys Leu Gin Thr 
5 

'";;&ln Xaa Ala Glu 

Ot»ro lie Leu Xaa 

Gly Thr Ala Lys 
55 

Gly Lys Ser Leu 
70 

Ser Gly Arg Phe 
85 



Gly Leu Asn Glu 

Ala Arg Ser Lys 
-70 

Gin Asp Leu Lys 
-55 

Glu Val Ser Glu 
-40 

Val Glu Ser Glu 
-25 

Arg Gin Leu Phe 

Asp lie Ser Glu 
10 

Phe Glu Lys Ala 
25 

Val Thr Xaa Pro 
40 

Leu Leu Asn Lys 

lie Leu Ser Cys 
75 

Val Ser Gly Pro 
90 



His His Gin Asn 
-85 

Arg Gly Leu Arg 

Glu Ser Arg Leu 
-50 

Val Leu Asn Gly 
-35 

Leu lie Asn Thr 
-20 

Ala Gin Ala Glu 
-5 

Leu Glu Asn Arg 
15 

Xaa lie Thr Ser 
30 

Lys Leu Ala Pro 
45 

Val He Cys He 
60 

His Cys Leu Gly 

Leu Arg He He 
95 



Glu Val He Asn 
-80 

Leu Lys Thr Val 
-65 

Val Glu Asp Thr 

Leu Gin Ala Val 
-30 

Ala Tyr Thr Asn 
-15 

Lys Trp Tyr Leu 
1 

Glu Leu Leu Glu 

Ser Asn Lys Lys 
35 

Leu Asn Glu Gly 
50 

He Leu Arg Asn 
65 

Trp Arg Asn Lys 
80 

Ser Pro Leu Gin 



<210> 398 
<211> 149 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -72. . -1 



-329- 



<400> 398 

Met Asn Leu Phe He Met Tyr Met Ala Gly Asn Thr He Ser He Phe 

-70 -65 -60 

Pro Thr Met Met Val Cys Met Met Ala Trp Arg Pro He Gin Ala Leu 

-55 -50 -45 

Met Ala He Ser Ala Thr Phe Lys Met Leu Glu Ser Ser Ser Gin Lys 
-40 -35 -30 -25 

Phe Leu Gin Gly Leu Val Tyr Leu He Gly Asn Leu Met Gly Leu Ala 

-20 -15 -10 

Leu Ala Val Tyr Lys Cys Gin Ser Met Gly Leu Leu Pro Thr His Ala 

-5 15 
Ser Asp Trp Leu Ala Phe He Glu Pro Pro Glu Arg Met Glu Ser Val 

10 15 20 

Val Glu Asp Cys Phe Cys Glu His Glu Lys Ala Ala Pro Gly Pro Tyr 
25 30 35 40 

Val Phe Gly Ser Tyr Leu His Pro Ser Leu Ser Pro Val Ala Pro Gin 

45 50 55 

His Thr Leu Lys Leu He Thr Tyr Val Lys Lys Asn Gin Lys Thr Leu 

60 65 70 

Phe Ser Met Val Gly 
75 



i^5c210> 399 

|'^211> 73 

^^<212> PRT 

ii=<213> Homo sapiens 

J'k220> 

: <221> SIGNAL 
,,,i,<222> -20 . . -1 

^;!<400> 399 

''Met Thr Pro Leu Leu Thr Leu He Leu Val Val Leu Met Gly Leu Pro 
'^20 -15 -10 -5 

iJl^Leu Ala Gin Ala Leu Asp Cys His Val Cys Ala Tyr Asn Gly Asp Asn 
'^i 1 5 10 

Cys Phe Asn Pro Met Arg Cys Pro Ala Met Val Ala Tyr Cys Met Thr 
15 20 25 

Thr Arg Thr Tyr Tyr Thr Pro Thr Arg Met Lys Val Ser Lys Ser Cys 
30 35 40 

Val Pro Arg Cys Phe Glu Xaa Cys Val 

45 50 



<210> 400 
<211> 86 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -20 . . -1 



<400> 400 



-330- 





Asn 


Leu. 


JllS 


File 


Pro 


Gin Trp 


Phe 


Val Hxs Ser Ser 


Ala 


Leu 


Gly 












-15 






-10 






-5 




VSi± 


Leu 


Aia. 


Pro 


Pro 


Phe Ser 


Ser 


Pro Gly Thr Asp 


Pro 


Thr 


Phe 
















5 




10 






Pdto 


Cys 


X-Le 


Tyr 


Cys 


Arg 


Leu Leu 


Asn 


Met He Met Thr 


Arg 


Leu 


Ala 






15 








20 




25 








Phe 


Ser 


Phe 


He 


Thr 


Cys 


Leu Cys 


Pro 


Asn Leu Lys Glu 


Val 


Cys 


Leu 




30 










35 




40 






He 


Leu 


Pro 


Glu 


Lys 


Asn 


Cys Asn Ser Arg His Ala Gly Phe Val Gly 


45 










50 






55 






60 


Pro 


Xaa 


Lys 


Leu 


Arg 


Gin 















65 



<210> 401 
<211> 78 
<212> PRT 

<213> Homo sapiens 

<220> 
,.^22 1> SIGNAL 
'^^|222> -21. . -1 

^■«i:400> 401 

^ket Cys Pro Val Phe Ser Lys Gin Leu Leu Ala Cys Gly Ser Leu Leu 
y1 -20 -15 -10 

4&ro Gly Leu Trp Gin His Leu Thr Ala Asn His Trp Pro Pro Phe Ser 
y5 15 10 

y!^aa Phe Leu Cys Thr Val Cys Ser Gly Ser Ser Glu Gin He Ser Glu 
■a 15 20 25 

ijyx: Thr Ala Ser Ala Thr Pro Pro Leu Cys Arg Ser Leu Asn Gin Glu 
Si I 30 35 40 

I'pro Phe Val Ser Arg Ala He Arg Pro Lys Tyr Ser He Thr 

50 55 

<210> 402 
<211> 65 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -28. . -1 

<400> 402 

Met Gly Lys Gly His Gin Arg Pro Trp Trp Lys Val Leu Pro Leu Ser 

-25 -20 -15 

Cys Phe Leu Val Ala Leu He He Trp Cys Tyr Leu Arg Glu Glu Ser 

-10 -5 1 

Glu Ala Asp Gin Trp Leu Arg Gin Val Trp Gly Glu Val Pro Glu Pro 
5 10 15 20 

Ser Asp Arg Ser Glu Glu Pro Glu Thr Pro Ala Ala Tyr Arg Ala Arg 
25 30 35 

Thr 



-331- 



<210> 403 
<211> 211 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -27. .-1 



<400> 403 



Met Leu 


Leu 


Leu 


Ser 


He 


Thr 


Thr 


Ala 


Tyr Thr Gly Leu 


Glu 


Leu 


Thr 




-25 










-20 






-15 








Phe Phe 


Ser Gly 


Val 


Tyr Gly Thr 


Cys 


He Gly Ala Thr 


Asn 


Lys 


Phe 


-10 










-5 








1 




5 


Gly Ala 


Glu 


Glu 


Xaa 


Ser 


Leu 


He 


Gly Leu 


Ser Gly He 


Phe 


He 


Gly 








10 










15 






20 




He Gly 


Glu 


He 


Leu Gly Gly Ser 


Leu 


Phe 


Gly Leu Leu 


Ser 


Lys 


Asn 






25 










30 






35 






.^(Asn Arg 


Phe Gly 


Arg Asn 


Pro 


Val 


Val 


Leu 


Leu Gly He 


Leu 


Val 


His 




40 










45 






50 








Ifhe He 


Ala 


Phe 


Tyr 


Leu 


He 


Phe 


Leu 


Asn 


Met Pro Gly 


Asp 


Ala 


Pro 


55 










60 








65 








i^ile Ala 


Pro 


Val 


Lys 


Gly Thr Asp 


Ser 


Ser 


Ala Tyr He 


Lys 


Ser 


Ser 


Ujo 








75 










80 






85 


^lijys Xaa 


Phe 


Ala 


He 


Leu 


Cys 


Xaa 


Phe 


Leu 


Xaa Gly Leu 


Gly Asn 


Ser 








90 










95 






100 




Ijfys Phe 


Asn 


Thr 


Xaa 


Leu 


Leu 


Xaa 


He 


Xaa Gly Phe Leu 


Tyr 


Ser 


Glu 






105 










110 






115 






i^|Caa Ser 


Ala 


Pro 


Xaa 


Phe 


Ala 


He 


Phe 


Asn 


Phe Val Gin 


Ser 


He 


Cys 




120 










125 






130 






V%la Ala 


Val 


Ala 


Phe 


Phe 


Tyr 


Ser 


Asn 


Tyr 


Leu Leu Leu 


His 


Trp 


Gin 


135 










140 








145 






]pevi Leu 


Val 


Met 


Val 


He 


Phe 


Gly 


Phe 


Xaa Gly Thr He 


Ser 


Phe 


Phe 


mso 








155 










160 






165 


i;i|?hr Val 


Glu 


Trp 


Glu 


Xaa 


Ala 


Ala 


Phe 


Val 


Xaa Arg Gly 


Ser 


Asp 


Tyr 



170 175 180 



Arg Ser He 



<210> 404 
<211> 123 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -80 . . -1 

<400> 404 

Met Ser Thr Trp Tyr Leu Ala Leu Asn Lys Ser Tyr Lys Asn Lys Asp 
-80 -75 -70 -65 

Ser Val Arg He Tyr Leu Ser Leu Cys Thr Val Ser He Lys Phe Thr 
-60 -55 -50 



-332- 



Tyr 


Jrlie 


xilS 


Asp 
-45 


He 


Gin 


Thr 


Asn 


Cys 
-40 


Leu 


Thr 


Thr 


Trp 


Lys 
-35 


His 


Ser 


Ar9" 


Cys 


Arg 


Phe 


Tyr Trp Ala Phe Gly 


Gly Ser 


He 


Leu 


Gin 


His 


Ser 






-30 










-25 










-20 








vai 


Asp 
-15 


Pro 


Leu 


Val 


Leu 


Phe 
-10 


Leu 


Ser 


Leu 


Ala 


Leu 
-5 


Leu 


Val 


Thr 


Pro 


Thr 


Ser 


Thr 


Pro 


Ser 


Ala 


Lys 


He 


Gin 


Ser 


Leu 


Gin 


He 


Asp 


Leu 


Pro 


1 








5 










10 










15 




Gly 


Gly 


Trp Arg 


Leu 


Ala 


Thr 


Asp 


Arg 


He 


Phe 


Thr 


Leu 


Ser 


Pro 


Val 








20 










25 










30 






Pro 


Met 


Asp 


Xaa 


Pro 


Leu 


He 


Leu 


His 


Gin 


Leu 













35 40 



<210> 405 
<211> Be 
<212> PRT 

<213> Homo sapiens 

<220> 
,,flc221> SIQ-NAL 
^'|:222> -26 . , -1 

:5400> 405 

fket Glu Lys Ser Trp Met Leu Trp Asn Phe Val Glu Arg Trp Leu He 
yl -25 -20 -15 

^'lAla Leu Ala Ser Trp Ser Trp Ala Leu Cys Arg He Ser Leu Leu Pro 

y|l0 -5 1 5 

ylieu He Val Thr Phe His Leu Tyr Gly Gly He He Leu Leu Leu Leu 

i!! 10 15 2 0 

i,f le Phe He Ser He Xaa Gly He Leu Tyr Lys Phe Xaa Asp Val Leu 

hi 25 3 0 35 

I'teu Tyr Phe Pro Xaa Gin Xaa Ser Ser Ser Arg Leu Tyr Asp Ser His 

n 50 
''4la His Trp Xaa Ser Xaa 
^Ab 60 



<210> 406 
<211> 162 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -31. . -1 

<400> 406 

Met Ala Ala Ala Trp Pro Ser Gly 

-30 -25 
Ala Arg Leu Val Gly Val Leu Trp 
-15 -10 
Trp Gly Ala Val Ala Thr Ser Ala 
5 

Glu Asp Leu Lys Val Gly Gin Tyr 



Pro Xaa Ala Pro Glu Ala Val Thr 
-20 

Phe Val Ser Val Thr Thr Gly Pro 

-5 1 
Gly Gly Glu Glu Ser Leu Lys Cys 
10 15 
He Cys Lys Asp Pro Lys He Asn 



-333- 







20 






25 




30 








Asp 


Ala 


Thr 


Gin 


Glu 


Pro Val Asn Cys Thr Asn 


Tyr 


Thr 


Ala 


His 


Val 




35 








40 


45 












Cys 


Pile 


Pro 


Ala 


Pro Asn lie Thr Cys Lys 


Asp 


Ser 


Ser 


Gly Asn 












55 60 










65 




Tnir 


HXS 


Fne 


Thr Gly Asn Glu Val Gly Phe Phe 


Lys 


Pro 


He 


Ser 










70 


75 








80 




Cys 


Arg 


Asn 


Val 


Asn 


Gly Tyr Ser Tyr Asn Glu 


Gin 


Ser 


His 


Val 


Ser 








85 




90 






95 






Phe 


Ser 


Trp 


Met 


Val 


Gly Ser Arg Ser lie Leu 


Pro 


Trp 


He 


Pro 


Cys 






100 






105 




110 






Phe 


Gly 


Phe 


Val 


Lys 


Xaa Xaa His Cys Arg Val 


Xaa 


Trp Asn 


Trp 


Glu 




115 








120 


125 








Pro 


Asn 




















130 























<210> 407 
<211> 98 
<212> PRT 
pr||c213> Homo sapiens 

r:<221> SIGNAL 
J'|<222> -37. .-1 

4Sc400> 407 

yj^et Ala Ser Leu Leu Cys Cys Gly Pro Lys Leu Ala Ala Cys Gly He 
yi -35 -30 -25 

Val Leu Ser Ala Trp Gly Val He Met Leu He Met Leu Gly He Phe 
hh -20 -15 -10 

n?^^ Asn Val His Ser Ala Val Leu He Glu Asp Val Pro Phe Thr Glu 

kif^ 15 10 

J Jljys Asp Phe Glu Asn Gly Pro Gin Asn He Tyr Asn Leu Tyr Xaa Gin 

15 20 25 

'jtaa Ser Tyr Asn Cys Phe He Ala Ala Gly Leu Tyr Leu Leu Leu Gly 
30 35 40 

Gly Phe Ser Phe Cys Gin Xaa Arg Leu Asn Lys Arg Lys Glu Tyr Met 

45 50 55 

Val Arg 
60 



<210> 408 
<211> 70 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -15. .-1 

<400> 408 

Met Arg Phe Leu Pro Cys Cys Leu Leu Trp Ser Val Phe Asn Pro Glu 
-15 -10 -5 1 



-334- 



Ser Leu Asn Cys His Tyr Phe Xaa 
5 

Leu Gin Tyr Tyr Glu lie Ser Leu 

20 25 
Trp Leu Cys Phe Leu Ser Tyr Phe 

35 40 
Asp Phe Ser Ser Phe Thr 
50 55 



Xaa Glu Xaa Cys lie Phe Xaa Ser 
10 15 
Gin Glu Lys Leu Leu Gly Phe Leu 
30 

Phe Arg Ala Val Tyr Phe Leu lie 
45 



<210> 409 
<211> 60 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -45. , -1 

<400> 409 

.;,f^et His Ser Leu Phe lie Ala Ser Leu Lys Val Leu Phe Tyr Tyr Ser 
:S45 -40 -35 -30 

:j#he Ser Phe Arg Phe Asn Trp Phe Asp Cys Leu Leu His Asn Leu Gly 
j'f -25 -20 -15 

ffelu Asn Phe Leu Ser Leu Leu Ser Lys Ser Cys Ser Ala Asp Pro Ser 
Ul -10 -5 1 

.|5Gly Ser Thr Phe Met Arg Asp He Glu Thr Asn Lys 
y 5 10 15 



nf210> 410 

f4:211> 39 

f'<:212> PRT 

''5213> Homo sapiens 

Oj:220> 
<221> SIGNAL 
<222> -22 . . -1 

<400> 410 

Met Pro Glu Ala Val Glu Gin Ser Ala His Leu Phe Val Thr Trp Ser 

-20 -15 -10 

Ser Gin Arg Ala Leu Ser His Pro Ala Pro Phe Leu Thr Xaa Xaa Lys 

-5 15 10 

Asn Pro Phe Leu Trp Lys Leu 
15 



<210> 411 

<211> 51 

<212> PRT 

<213> Homo sapiens 



<220> 



-335- 



<221> SIGNAL 
<222> -23. .-1 

<400> 411 

Met Ala Phe Gin Ser Leu Leu Glu Met Lys Phe Phe Leu Cys Ala Ala 

-20 -15 -10 

Phe Pro Leu Gly Ala Gly Val Lys Met Phe His Tyr Leu Gly Pro Gly 

-5 15 
Lys Pro Leu Xaa Gin Ala Ser Pro Ser Pro His Pro His Arg Xaa Arg 
10 15 20 25 

lie Trp Pro 



<210> 412 

<211> 95 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 

^.f 222> -48 . . -1 

^fi400> 412 

yket Ala Ser Ser His Trp Asn Glu Thr Thr Thr Ser Val Tyr Gin Tyr 

-45 -40 -35 

yieu Gly Phe Gin Val Gin Lys He Tyr Pro Phe His Asp Asn Trp Asn 
^1^ -30 -25 -20 

jl-hr Ala Cys Phe Val He Leu Leu Leu Phe He Phe Thr Val Val Ser 
J1 -10 -5 

; Leu Val Val Leu Ala Phe Leu Tyr Glu Val Leu Xaa Xaa Cys Cys Cys 

5 10 15 

.^Val Lys Asn Lys Thr Val Lys Asp Leu Lys Ser Glu Pro Asn Pro Leu 
I 20 25 30 

[%a.a Xaa Met Met Asp Asn He Arg Lys Arg Glu Thr Glu Val Val 
35 40 45 



<210> 413 

<211> 60 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -32. .-1 

<400> 413 

Met Asp Glu Tyr Ser Trp Trp Cys His Val Leu Glu Val Val Lys Gly 

-30 -25 -20 

Gin Met Phe Thr Phe He Asn He Thr Leu Trp Leu Gly Ser Leu Cys 

-15 -10 -5 

Gin Arg Phe Phe Tyr Ala Ser Gly Thr Tyr Phe Leu He Tyr He Ser 
15 10 15 

Thr Val Thr Pro Ser Trp Arg Leu Cys Leu Val Ser 
20 25 



-336- 



<210> 414 
<211> 170 
<212> PRT 
<213> Homo sapiens 



<220> 


























<221> SIGNAL 






















<222> 


79. 


-1 






















<400> 414 
























Met 


Glu 


Asp 


Pro 


Asn Pro Glu 


Glu 


Asn 


Met 


Lys 


Gin 


Gin 


Asp 


Ser 


Pro 










-75 






-70 










-65 




Lys 


Glu 


Arg 


Ser 


Pro Gin Ser 


Pro Gly Gly 


Asn 


He 


Cys 


His 


Leu Gly 








-60 






-55 










-50 






Ala 


Pro 


Lys 


Cys 


Thr Arg Cys 


Leu 


He 


Thr 


Phe 


Ala 


Asp 


Ser 


Lys 


Phe 






-45 






-40 










-35 






Gin 


Glu 


Arg 


His 


Met Lys Arg 


Glu 


His 


Pro 


Ala 


Asp 


Phe 


Val 


Ala 


Gin 




-30 






-25 










-20 












Leu 


Gin 


Gly Val Leu Phe 


He 


Cys 


Phe 


Thr 


Cys Ala 


Arg 


Ser 


Phe 










-10 








-5 










1 




Ser 


Ser 


Lys 


Ala Xaa Xaa 


Thr 


His 


Gin 


Arg 


Ser 


His 


Gly Pro Xaa 


^^^kla 






5 






10 










15 






Lys 


Pro 


Thr 


Leu Pro Val 


Ala 


Thr 


Thr 


Thr 


Ala 


Gin 


Pro 


Thr 


Phe 






20 






25 










30 








4f>ro 


Cys 


Pro 


Asp 


Cys Gly Lys 


Thr 


Phe 


Gly 


Gin 


Ala 


Val 


Ser 


Leu 


Xaa 




35 






40 










45 










|j%rg His 


Xaa 


Gin 


Xaa His Glu 


Val 


Arg 


Ala 


Pro 


Pro 


Gly 


Thr 


Phe 


Ala 










55 








60 








65 




Thr 


Xaa 


Cys Gly Gin Asp Phe Ala Gin 


Glu 


Xaa 


Gly 


Leu 


His 


Gin 










70 






75 










80 




[|is 


Tyr 


He 


Arg His Ala Arg Gly Gly Leu 















85 90 



<210> 415 

<211> 190 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 

<222> -82. . -1 

<400> 415 

Met Tyr Val Trp Pro Cys Ala Val 
-80 -75 
His Arg Arg Ser Leu Pro Gly Lys 

-65 -60 
Val Ser Leu Pro Gly He Leu Ala 
-50 -45 
Leu Ser Asp Ser Ser Glu Leu Pro 
-30 

Ser Cys Gin Met Asn Asn Leu Pro 



Val Leu Ala Gin Tyr Leu Trp Phe 
-70 

Ala He Leu Glu He Gly Ala Gly 
-55 

Ala Lys Cys Gly Ala Glu Val He 
-40 -35 
His Cys Leu Glu Val Cys Arg Gin 

-25 -20 
His Leu Gin Val Val Gly Leu Thr 



-337- 



-15 

Trp Gly His He 
1 

He Leu Ala Ser 
15 

Leu Ala Thr He 

Trp Ser Thr Tyr 
50 

Leu Tyr Lys Trp 
65 

Asp Ala Asp Lys 
80 

Thr Val Glu Met 
95 



Ser Trp Asp Leu 
5 

Asp Val Phe Phe 
20 

Tyr Phe Leu Met 
35 

Gin Val Arg Xaa 

Asp Met Lys Cys 
70 

Glu Xaa He Ala 
85 

Leu Val He Ser 
100 



-10 

Leu Ala Leu Pro 
10 

Glu Pro Glu Xaa 
25 

His Lys Asn Pro 
40 

Ala Asp Trp Ser 
55 

Val His He Pro 

Glu Ser Thr Leu 
90 

Phe Ala Lys Asp 
105 



-5 

Pro Gin Asp He 

Phe Glu Asp He 
30 

Lys Val Gin Leu 
45 

Leu Glu Ala Leu 
60 

Leu Glu Ser Phe 
75 

Pro Gly Arg His 
Ser Leu 



<210> 416 

<211> 114 

<212> PRT 

.|.f213> Homo sapiens 

Jft220> 

p^221> SIGNAL 
H222> -60 . . -1 

"l<400> 416 

iMet Met Ala Ala Val Pro Pro Gly Leu Glu Pro Trp Asn Arg Val Arg 
meO -55 -50 -45 

He Pro Lys Ala Gly Asn Arg Ser Ala Val Thr Val Gin Asn Pro Gly 
-40 -35 -30 

^=|Ala Ala Leu Asp Leu Cys He Ala Ala Val He Lys Glu Cys His Leu 

11 '^^ -15 

Ser Gin Thr Leu Asp Ala Glu Thr Asp Val 
-10 -5 1 

■=:Leu Cys Ala Val Leu Tyr Ser Asn His Asn Arg Met Gly Arg His Lys 
Ul 10 15 20 

Pro His Leu Ala Leu Lys Gin Val Glu Gin Cys Leu Lys Arg Leu Lys 

25 30 35 

Asn Met Asn Leu Glu Gly Ser He Gin Asp Leu Phe Glu Leu Phe Ser 
40 45 50 

Ser Lys 



<210> 417 
<211> 161 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -108. . -1 

<400> 417 

Met Thr Ser Gly Gin Ala Arg Ala Ser Xaa Gin Ser Pro Gin Ala Leu 
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-105 


-100 






-95 








Asp 


Seir 


Gly Pro Val Asn He 


Ser Val Ser 


He 


Thr 


Leu 


Thr 


Leu 






- y u 


-85 






-80 








Asp 


Piro 


Leu 


Lys Pro Phe Gly Gly Tyr Ser Arg 


Asn 


Val 


Thr 


His 


Leu 




- /D 




-70 




-65 










Tyr 


SeiT 


Tnir 


TT/^i T/^^i /"^Tr*- TT-J« 

lie Leu Gly His Gin 


He Gly Leu 


Ser Gly Arg 


Glu 


Ala 


-60 






-55 


-50 










-45 


XT-? a 






He Asn He Thr Phe 


Thr Leu Pro 


Thr 


Ala 


Trp 


Ser 


Ser 








-40 


-35 






-30 




Asp 


Asp 


Cys 


Ala Leu His Gly His 


Cys Glu Gin 


Val 


Val 


Phe 


Thr 


Ala 








-25 


-20 






-15 






Cys 


Met 


Thar 


Leu Thr Ala Ser Pro 


Gly Val Phe 


Pro 


Ser 


Leu 


Tyr 


Ser 






-10 


-5 






1 






His 


Arg 


Thr 


Val Phe Leu Thr Arg 


Thr Ala Thr 


Pro 


Arg 


Ser 


Gly Thr 


5 






10 


15 










20 


Arg 


Ser 


Ser 


Gin Leu Pro Glu Met 


Pro Thr Gin 


Asn 


Thr 


Pro 


Lys 


He 








25 


30 








35 




Thr 


He 


Leu 


Ser Gly Val He Arg Gly Pro Leu 


Glu 


Lys 


Ser 


He 


Met 








40 


45 






50 







Leu 



;;;i:210> 418 

;'4:211> 67 

K212> PRT 

i:;<213> Homo sapiens 

bj:220> 

lj%221> SIGNAL 
; <222> -21. . -1 

.^f400> 418 

Piet Leu Gly Gly Asp His Arg Ala Leu Leu Leu Lys He Trp Leu Leu 
''I -15 -10 

'Gin Arg Pro Glu Ser Gin Glu Gly Leu Leu Pro Gly Arg Leu Val Val 
IfS 15 10 

JlHet Glu Arg Arg Val Lys Asn Asp Leu Met Ser Phe Leu Ser Thr Val 

15 20 25 

Leu Leu Ser Phe His Ser Ser Asn Ala Arg Val Ser His Cys Glu Pro 

30 35 40 

Leu Arg Met 
45 



<210> 419 
<211> 332 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -32 . . -1 

<400> 419 

Met He Xaa Leu Arg Asp Thr Ala Ala Ser Leu Arg Leu Glu Arg Asp 
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-30 -25 -20 



Thr Arg Gin 


Leu 


Pro 


Leu 


Leu 


Thr 


Ser 


Ala 


Leu His Gly Leu 


Gin 


Gin 


-15 










-10 










-5 










Gin His 


Pro 


Ala 


Phe 


Ser Gly Val Ala 


Arg Leu Ala Lys Arg 


Trp Val 


1 






5 










10 










15 




Arg Ala Gin 


Leu 


Leu Gly Glu Gly Phe 


Ala Asp Glu Ser Leu 


Asp 


Leu 






O f\ 

Z U 










25 










30 






Val Ala 


Ala 


Ala 


Leu 


Phe 


Leu 


His 


Pro 


Glu 


Pro 


Phe 


Thr 


Pro 


Pro 


Ser 




35 










40 










45 








Ser Pro 


Gin 


Val 


Gly Phe Leu Arg Phe 


Leu 


Phe 


Leu 


Val 


Ser 


Thr 


Phe 


50 










55 










60 










Asp Trp 


Lys 


Asn 


Asn 


Pro 


Leu 


Phe 


Val 


Asn 


Leu 


Asn 


Asn 


Glu 


Leu 


Thr 


65 








70 










75 










80 


Val Glu 


Glu 


Gin 


Val 


Glu 


He 


Arg 


Ser 


Gly Phe 


Leu 


Ala 


Ala 


Arg Ala 








85 










90 










95 




Gin Leu 


Pro 


Val 


Met 


Val 


He 


Val 


Thr 


Pro 


Gin 


Xaa 


Arg 


Lys 


Asn 


Ser 






100 










105 










110 






Val Trp 


Thr 


Gin 


Asp Gly Pro 


Ser 


Ala 


Gin 


He 


Leu 


Gin 


Gin 


Leu 


Val 




115 










120 










125 








Val Leu 


Ala 


Ala 


Glu 


Xaa 


Leu 


Pro 


Met 


Leu 


Xaa 


Xaa 


Gin 


Leu 


Met 


Asp 


13 0 










135 










140 








,;..|>ro Arg 


Gly 


Pro 


Gly Asp 


He 


Arg 


Thr 


Xaa 


Phe 


Arg 


Pro 


Pro 


Leu 


Asp 










150 










155 










160 


::f le Tyr Asp 


Val 


Leu 


He 


Arg 


Leu 


Ser 


Pro Arg His 


He 


Pro 


Arg His 








165 










170 










175 




j;''Arg Gin Ala 


Val 


Asp 


Ser 


Pro 


Ala 


Ala 


Ser 


Phe 


Cys 


Arg Gly 


Leu 


Leu 






180 










185 










190 






^per Gin 


Pro 


Gly 


Pro 


Ser 


Ser 


Leu 


Met 


Pro 


Val 


Leu Gly Xaa 


Asp 


Pro 




195 










200 










205 








IJf ro Gin 


Leu 


Tyr 


Leu 


Thr 


Gin 


Leu 


Xaa 


Glu 


Ala 


Phe 


Gly Asp 


Leu 


Ala 


I 210 










215 










220 










l^„Leu Phe 


Phe 


Tyr 


Asp 


Gin 


His 


Gly Gly 


Glu 


Val 


He 


Gly Val 


Leu 


Trp 


J!J|25 








230 










235 










240 


;'tys Pro 


Thr 


Ser 


Phe 


Gin 


Pro 


Gin 


Pro 


Phe 


Lys 


Ala 


Ser 


Ser 


Thr 


Lys 








245 










250 










255 


;^ly Arg Met 


Val 


Met 


Ser 


Arg Gly Gly 


Glu 


Leu 


Val 


Met 


Val 


Pro 


Asn 






260 










265 










270 






IJyal Glu Ala 


He 


Leu 


Glu 


Asp 


Phe 


Ala 


Val 


Leu 


Gly Glu Gly 


Leu 


Val 




275 










280 










285 








Gin Thr 


Val 


Glu 


Ala 


Arg 


Ser 


Glu 


Arg 


Trp 


Thr 


Val 











290 295 300 



<210> 420 
<211> 65 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -19. . -1 

<400> 420 

Met Gly Gly He Trp Asn Ala Leu Ser Met Ser Ser Phe Ser Phe His 

-15 -10 -5 

Ser Ser Ser Cys Ser Ala Leu Ser Ala Lys Ser Leu Leu Ser Arg His 



-340- 



15 10 
His He Leu Gin Gin Phe Leu Val Arg Lys Ser Val Pro Leu Glu Asn 

15 20 25 

Ala Ser Leu Pro Phe Pro His Leu Gly Ser Ser Leu Phe Lys He Val 
30 35 40 45 

Gly 



<210> 421 

<211> 57 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 

<222> -30. .-1 

<400> 421 

Met Pro Thr Gly Lys Gin Leu Ala 

-30 -25 

Jhr Ser Met Met Leu Leu Thr Val 

f^l^rg Val Tyr His Tyr Phe Gin Trp 

Ms IQ 

^felu Glu Gin Lys Xaa Ser Gly He 

J1 20 25 



Asp He Gly Tyr Lys Thr Phe Ser 
-20 -15 
Tyr Gly Gly Tyr Leu Cys Ser Val 

-5 1 
Arg Arg Ala Gin Arg Gin Ala Ala 
15 

Met 



^ <210> 422 

[<211> 85 

r<:212> PRT 

'f:212> Homo sapiens 

'^l:220> 

Ijc221> SIGNAL 
]3c222> -17. .-1 

<400> 422 



Met Lys 


Lys 


Val 


Leu 


Leu 


Leu 


He 


Thr 


Ala 


He Leu 


Ala 


Val 


Ala 


Val 




-15 










-10 








-5 








Gly Phe 


Pro 


Val 


Ser 


Gin 


Asp 


Gin 


Glu Arg Glu Lys Arg Ser He Ser 


1 








5 










10 








15 


Asp Ser 


Asp 


Glu 


Leu 


Ala 


Ser Gly Xaa 


Phe 


Val Phe 


Pro 


Tyr 


Pro 


Tyr 








20 










25 








30 


Pro Phe 


Arg 


Pro 


Leu 


Pro 


Pro 


He 


Pro 


Phe 


Pro Arg 


Phe 


Pro 


Trp 


Phe 






35 










40 








45 




Arg Arg 


Asn 


Phe 


Pro 


He 


Pro 


He 


Pro 


Glu 


Ser Ala 


Pro 


Thr 


Thr 


Pro 




50 










55 








60 








Leu Pro 


Ser 


Glu 


Lys 























65 



<210> 423 
<211> 85 
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<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -17. .-1 



<400> 423 
Met Lys Lys Val 
-15 

Gly Phe Pro Val 
1 

Asp Ser Asp Glu 

Pro Phe Arg Pro 
35 

Arg Arg Asn Phe 
50 

Leu Pro Ser Glu 
65 



Leu Leu Leu lie 
-10 

Ser Gin Asp Xaa 
5 

Leu Ala Ser Gly 
20 

Leu Pro Pro lie 

Pro lie Pro lie 
55 

Lys 



Thr Ala lie Leu 

Glu Arg Glu Lys 
10 

Phe Phe Val Phe 
25 

Pro Phe Pro Arg 
40 

Pro Glu Ser Ala 



Ala Val Ala Val 
-5 

Arg Ser lie Ser 
15 

Pro Tyr Pro Tyr 
30 

Phe Pro Trp Phe 
45 

Pro Thr Thr Pro 
60 



ffc210> 424 

^k211> 69 

''h:212> PRT 

Jk213> Homo sapiens 

J1:221> SIGNAL 
'. <222> -29. .-1 

J^400> 424 

^l^et Thr Cys Arg Gly Ser Cys Ser Tyr Ala Thr Arg Arg Ser Pro Ser 

n "^^ "20 -15 

■;^lu Leu Ser Leu Leu Pro Ser Ser Leu Trp Val Leu Ala Thr Ser Ser 
U -10 -5 1 

}pro Thr He Thr He Ala Leu Ala Met Ala Ala Gly Asn Leu Cys Pro 
5 10 15 

Leu Pro Ser Ser Xaa Arg Xaa Lys Arg Arg Trp Cys Gin Ala Xaa Gin 
20 25 30 35 

Gin Xaa Ala Leu Leu 
40 



<210> 425 
<211> 122 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -56. . -1 

<400> 425 

Met Val Pro Trp Pro Arg Gly Lys Val Lys Thr Ala Pro He Pro He 
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-55 






-50 








-45 












Arg 


JrTLe 


Pro 


Phe Leu Pro Thr 


His 


Asp 


Pro 


Pro 


Thr 


Pro 


Ala 


His 


A A 

-40 








-35 






-30 










-25 


Trp 


Ser 


Pro 


Ala 


Ser His Gin Gin 


Phe 


Lys 


His 


Xaa 


Ser 


Pro 


Leu 


Leu 










-20 




-15 










-10 




1 HIT 


Leu 


Til rs 

i\j.a 


Leu 

_ c 


Leu Gly Gin Cys 


Ser 


Leu 


Phe 


Xaa 


Asn 


Leu 


Arg 


Lys 


Lys 


Leu 


A±a 


Gly Gin Lys Ala Lys 


1 

Lys 


Leu 


Pro 


Ser 


5 

Phe 


Ser 


Ser 


Leu 




10 






15 








20 










Pro 


Leu 


Thr 


Leu 


Trp Pro Leu Thr 


Pro 


Gin 


Phe 


Ala 


Glu 


Leu 


Thr 


Thr 


25 








30 






35 










40 


Val 


Ala 


Gin 


Lys 


Lys Leu Arg Trp Ser Gly Thr Leu Gly Trp Gly Pro 










45 




50 










55 




Val 


Pro 


Ser 


Trp 


Val Gin Phe Phe 


Leu 


Gly 















60 65 



<210> 426 

<211> 41 

<212> PRT 

.,,^213 > Homo sapiens 

::R220> 

j'U221> SIGimL 
H222> -30 . . -1 

4g400> 426 

ylflet Ala Cys Glu Thr His Gly Val Leu Val Pro Ala His Leu Ser Gly 
IJI^O -25 -20 -15 

^^'Leu He Thr Cys Leu Leu Ala Phe Trp Val Pro Ala Ser Cys He Gin 
\^ -5 1 

^:^Arg Cys Ser Gly Ser Pro Leu Pro Leu 
5 10 



p|:210> 427 
<211> 50 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -36. .-1 

<400> 427 

Met Ala Pro His Thr Ala Ser Phe Gly Val Cys Pro Leu Leu Ser Val 

-35 -30 -25 

Thr Arg Val Val Ala Thr Glu His Trp Leu Phe Leu Ala Ser Leu Ser 
-20 -15 -10 -5 

Gly He Lys Thr Tyr Gin Ser Tyr He Ser Val Phe Cys Lys Val Thr 
15 10 

Leu He 
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<210> 428 
<211> 136 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -18, *-l 



<400> 428 
Met Asp Ser Leu 
-15 

Xaa Ala Gly Val 
1 

Arg Arg Lys Gin 
15 

Ser Arg Glu Glu 

Gin Glu Ala Ala 
50 

Met Val Gly Gly 

|lLeu Ala Ser Val 
{i 80 

J^^Gln Leu Ser Pro 
P|95 

]Met Pro Gly Leu 



Arg Lys Met Leu 

Gly Tyr Ala Leu 
5 

Glu Met Leu Lys 
20 

Ala Ala Arg Thr 
35 

Thr Thr Gin Glu 

Glu Gly Gly Ala 
70 

Gly Ala Gly Pro 
85 

Ser Trp Ala Xaa 
100 

Ser Gly Val Leu 
115 



He Ser Val Ala 
-10 

Leu Val He Val 
10 

Glu Met Pro Leu 
25 

Gin Gin Leu Leu 

'40 

Asn Val Ala Trp 
55 

Thr Gly Xaa His 

Trp Leu Gly Arg 
90 

Arg Lys He Arg 
105 



Met Leu Gly Ala 
-5 

Thr Pro Gly Glu 

Gin Asp Pro Arg 
30 

Leu Ala Thr Leu 
45 

Arg Lys Asn Trp 
60 

Arg Glu Thr Gly 
75 

Arg Asn Pro Arg 

Xaa Glu Asn Xaa 
110 



'^<210> 429 
ij<211> 194 
.jb<212> PRT 
y<213> Homo sapiens 

%<220> 

^^<221> SIGNAL 
<222> -65. . -1 



<400> 429 

Met Gin Asp Ala 

-65 

Val Ser Ser Ala 

Arg Asn Leu Pro 
-30 

Ala Ala Gly He 
-15 

Ser Lys He Tyr 
1 

Gly Leu Asp Asn 

Ala Tyr Tyr Glu 
35 

Asp Gly Ser He 
50 



Pro Leu Ser Cys 
-60 

Asp Ser Thr Glu 
-45 

Phe Gin Phe Cys 

Leu Thr Leu He 
-10 

Thr Arg Cys Lys 
5 

Xaa Arg Gly Phe 
20 

Ser Gly Tyr Asn 

Asp Tyr Gly He 
55 



Leu Ser Pro Thr 
-55 

Lys Ser Ala Ser 
-40 

Leu Arg Gin Ala 
-25 

Gly Cys Leu Val 

Leu Ala Lys He 
10 

Ser Leu Gly Asn 
25 

Thr Thr Ala Gin 
40 

Phe Gin He Asn 



Lys Trp Ser Ser 
-50 

Ala Ala Gly Thr 
-35 

Leu Arg Met Lys 
-20 

Thr Gly Val Glu 
-5 

Phe Ser Arg Ala 
15 

Trp He Cys Met 
30 

Thr Val Leu Asp 
45 

Ser Phe Ala Trp 
60 
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Cys Arg Arg Gly 
65 

Ser Ala Leu Xaa 
80 

Lys lie Val Lys 

Lys His Cys Glu 
115 

Val Ser 



Lys Leu Lys Glu 
70 

Thr Asp Asp Leu 
85 

Glu Thr Gin Gly 
100 

Gly Arg Asp Leu 



Asn Asn His Cys 
75 

Thr Asp Ala He 
90 

Met Asn Tyr Trp 
105 

Ser Xaa Trp Lys 
120 



His Val Ala Cys 

He Cys Ala Xaa 
95 

Gin Gly Trp Lys 
110 

Lys Gly Cys Glu 
125 



<210> 430 
<211> 141 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -69. . -1 



<400> 430 
h^et Thr Ser Gin 

vhsn Val lie Asn 

Ijlaly Gin Asp Ser 
-3 5 

fjply Thr He Gin 
J lie Leu Ala Ser 

r ^-5 

^*^'Thr Leu Leu Asn 

Phjys Xaa Ser Glu 
H 30 
y^sn Ser Leu Asn 

n 

kaa Gin Gly Phe 
60 



Pro Val Pro Asn 
-65 

Phe Ser Gin Ala 

Leu Lys Lys His 
-30 

He Leu Cys Gly 
-15 

Ala Ser Phe Ser 
1 

Ser Ala Tyr Pro 

Glu Gly Arg Met 
35 

Phe Pro Xaa Ala 
50 

Asn Gly Glu Ser 



Glu Thr He He 
-60 

Glu Lys Pro Glu 
-45 

Leu His Ala Glu 

Met Met Val Leu 
-10 

Pro Asn Phe Thr 
5 

Phe He Gly Pro 
20 

Gly Gin Xaa Gly 

Ser Leu Leu Xaa 
55 

Cys Ser Pro Val 



Val Leu Pro Ser 
-55 

Pro Thr Asn Gin 
-40 

He Lys Val He 
-25 

Ser Leu Gly He 

Gin Val Thr Ser 
10 

Phe Phe Val Xaa 
25 

Glu Glu Xaa Xaa 
40 

Leu He Cys Gin 
Gly 



<210> 431 
<211> 248 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -69. . -1 



<400> 431 

Met Thr Ser Gin 

Asn Val He Asn 
-50 



Pro Val Pro Asn 
-65 

Phe Ser Gin Ala 



Glu Thr He He 
-60 

Glu Lys Pro Glu 
-45 



Val Leu Pro Ser 
-55 

Pro Thr Asn Gin 
-40 
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Gly Gin Asp Ser Leu Lys Lys His Leu His Ala Glu Xaa Lys Val He 

-35 -30 -25 

Gly Thr He Gin He Leu Cys Gly Met Met Val Leu Ser Leu Gly He 

-20 -15 -10 

He Leu Ala Ser Ala Ser Phe Ser Pro Asn Phe Thr Gin Val Thr Ser 
-5 15 10 

Thr Leu Leu Asn Ser Ala Tyr Pro Phe He Gly Pro Phe Phe Phe He 

15 20 25 

He Ser Gly Ser Leu Ser He Ala Thr Lys Lys Arg Leu Thr Asn Leu 

30 35 40 

Leu Val His Thr Thr Leu Val Gly Ser He Leu Ser Ala Leu Ser Ala 

45 50 55 

Leu Val Gly Phe He Xaa Leu Ser Val Lys Gin Ala Thr Leu Asn Pro 
SO 65 ' 70 75 

Ala Ser Leu Xaa Cys Glu Leu Xaa Lys Asn Asn He Pro Thr Xaa Xaa 

80 85 90 

Tyr Val Xaa Tyr Phe Tyr His Asp Ser Leu Tyr Thr Thr Asp Xaa Tyr 

^5 100 105 

Thr Ala Lys Ala Xaa Leu Ala Gly Thr Leu Ser Leu Met Leu He Cys 

110 115 120 

Thr Leu Leu Glu Phe Cys Xaa Xaa Val Leu Thr Ala Val Leu Arg Trp 
125 130 135 

ys Gin Ala Tyr Ser Asp Phe Pro Gly Ser Val Leu Phe Leu Pro Xaa 
^440 145 150 155 

=^er Tyr He Gly Asn Ser Gly Met Ser Ser Lys Met Thr His Asp Cys 
t 1^0 1S5 170 

Jtsly Tyr Glu Glu Leu Leu Thr Ser 
£ 175 



,<210> 432 
'^f211> 49 

^:;^2i2> PRT 

''''ic213> Homo sapiens 

"4 

|Jk220> 

Jlc221> SIGNAL 
<222> -36 . . -1 

<400> 432 

Met Gin Val Pro His Leu Arg Val Trp Thr Gin Val Xaa Asp Thr Phe 

-35 -30 -25 

He Gly Tyr Arg Asn Leu Gly Phe Thr Ser Met Cys He Leu Phe His 
-20 -15 -10 _5 

Cys Leu Leu Ser Phe Gin Val Phe Lys Lys Lys Arg Lys Leu Xaa Leu 
15 10 

Phe 



<210> 433 
<211> 86 
<212> PRT 

<213> Homo sapiens 



<220> 



-346- 



<221> SIGNAL 
<222> -14. .-1 

<400> 433 

Met Val Ala Leu 

Asp Pro Arg Arg 
5 

Ala Asp Thr Gly 
20 

Arg Arg Ser Gin 
35 

Glu Asn Gin Pro 

His Arg lie Cys 
70 



Asn Leu lie Leu 
-10 

lie His Ser Gin 
10 

Ser Ala Met Gin 
25 

Pro Phe Ser Val 
40 

Gly Lys Leu Ser 
55 

Asp Leu 



Val Pro Cys Cys 
-5 

Asp Asp Val Leu 

Arg Arg Glu Ala 
30 

Gly Leu Pro Ser 
45 

Trp Arg Ser Leu 
60 



Ala Ala Trp Cys 
1 

Arg Ser Ser Ala 
15 

Trp Ala Gly Trp 

Ala Glu Arg Leu 
50 

Val Gly Glu Gly 
65 



<210> 434 

<211> 144 
,,,^212> PRT 
^=:;f:213> Homo sapiens 

f^k220> 

^^221> SIGNAL 
iJ1;222> -58. .-1 

i;<400> 434 

yflet Thr Arg Leu Cys Leu Pro Arg Pro Glu Ala Arg Glu Asp Pro lie 

-55 -50 -45 

L-F^ro Val Pro Pro Arg Gly Leu Gly Ala Gly Glu Gly Ser Gly Ser Pro 

-40 -35 -30 

:'5^al Arg Pro Pro Val Ser Thr Trp Gly Pro Ser Trp Ala Gin Leu Leu 
n -25 -20 -15 

■';4sp Ser Val Leu Trp Leu Gly Ala Leu Gly Leu Thr lie Gin Ala Val 
#10 -5 15 

Dl'he Ser Thr Thr Gly Pro Ala Leu Leu Leu Leu Leu Val Ser Phe Leu 
10 15 20 

Thr Phe Asp Leu Leu His Arg Pro Ala Val Thr Leu Cys His Ser Ala 

25 30 35 

Asn Phe Ser Pro Gly Ala Arg Val Arg Gly Pro Val Lys Val Leu Asp 

40 45 50 

Ser Arg Arg Leu Tyr Ser Cys Lys Trp Val Gin Ser Gin Asp Asn Leu 
55 60 65 70 

Ala Ser Arg Lys His Cys Cys Cys Cys Ser Trp Gly Trp Ala Arg Ser 
75 80 85 



<210> 435 
<211> 121 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 



-347- 



<222> -16 . . -1 
<400> 435 



Met 


Glu 


Arg 


Leu 


Val 


Leu Thr Leu Cys 


Thr 


Leu 


Pro 


Leu Ala 


Vai 


TV n ^ 

Ala 




-15 








-10 






-5 








Ser 


Ala 


Gly 


Cys 


Ala 


Thr Thr Pro Ala 


Arg Asn 


Leu 


Ser Cys 


Tyr 


Gin 


1 








5 




10 








lb 




Cys 


Phe 


Lys 


Val 


Ser 


Ser Trp Thr Glu 


Cys 


Pro 


Pro 


Thr Trp 


Cys 


Ser 








20 




25 








30 






Pro 


Leu 






Val 


Cys lie Ser Asn 


Glu 


Val 


Val 


Val Ser 


Phe 


Ser 






35 






40 








45 






Glu 


Ser 


Pro 


Pro 


Gly Arg Gly Xaa Val 


Pro 


Xaa 


Ala Gly Glu 


Xaa 


Pro 




50 








55 






60 








Val 


Pro 


Pro 


Pro 


Leu 


Xaa Asp Leu Xaa 


Met 


Thr 


Pro 


Arg Xaa 


Xaa 


Arg 


65 










70 




75 








80 


Ala 


Trp 


Gly 


Pro 


Val 


Gly Pro Lys Val 


Pro 


Pro 


Ala 


Val Ser 


Pro 


Ala 










85 




90 








95 




Leu Gly 


Ser 


Gly 


Glu 


His Pro Xaa Xaa 




















100 




105 















;;<k210> 436 

J3c211> 162 

1fk2l2> PRT 

-k213> Homo sapiens 

|;5c220> 

:|;221> SIGNAL 
li:222> -16 . . -1 

|,<400> 436 

;^'|^et Glu Arg Leu Val Leu Thr Leu Cys Thr Leu Pro Leu Ala Val Ala 

-15 -10 -5 

''^er Ala Gly Cys Ala Thr Thr Pro Ala Arg Asn Leu Ser Cys Tyr Gin 

5 10 15 

jy:ys Phe Lys Val Ser Ser Trp Thr Glu Cys Pro Pro Thr Trp Cys Ser 
i;i 20 25 30 

Pro Leu Asp Gin Val Cys He Ser Asn Glu Val Val Val Ser Phe Lys 

35 40 45 

Trp Ser Val Arg Val Leu Leu Ser Lys Arg Cys Ala Pro Arg Cys Pro 

50 55 60 

Asn Asp Asn Met Xaa Phe Glu Trp Ser Pro Ala Pro Met Val Gin Gly 
65 70 75 80 

Val He Thr Arg Arg Cys Cys Ser Trp Ala Leu Cys Asn Arg Ala Leu 

85 90 95 

Thr Pro Gin Glu Gly Arg Trp Ala Leu Xaa Gly Gly Leu Leu Leu Gin 

100 105 110 

Asp Pro Ser Arg Gly Xaa Lys Thr Trp Val Arg Pro Gin Leu Gly Leu 

115 120 125 

Pro Leu Cys Leu Pro Xaa Ser Asn Pro Leu Cys Pro Xaa Glu Thr Gin 

130 135 140 

Glu Gly 
145 



-348- 



<210> 437 
<211> 110 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -20. . -1 

<400> 437 

Met Xaa Leu Met Val Leu Val Phe Thr He Gly Leu Thr Leu Leu Leu 
-20 -15 -10 -5 

Gly Xaa Gin Ala Met Pro Ala Asn Arg Leu Ser Cys Tyr Arg Lys He 

15 10 
Leu Lys Asp His Asn Cys His Asn Leu Pro Glu Gly Val Ala Asp Leu 

15 20 25 

Thr Gin He Asp Val Asn Val Gin Asp His Phe Trp Asp Gly Lys Gly 

30 35 40 

Cys Glu Met He Cys Tyr Cys Asn Phe Lys Arg He Ala Leu Leu Pro 
45 50 55 60 

Lys Arg Arg Phe Leu Trp Thr Lys Asp Leu Phe Arg Asp Ser Leu Gin 

65 70 75 

"^In Ser Met Arg He Phe Met Tyr Ser Gly Glu His His Ser 
Jt 80 85 90 



.|i:210> 43 8 
ijj:211> 71 
yi:212> PRT 
<213> Homo sapiens 

.r<220> 

';l:221> SIGISIAL 
'|222> -15. .-1 

I3:400> 438 

iflet Lys Leu Leu Thr His Asn Leu Leu Ser Ser His Val Arg Gly Val 
-15 -10 -5 1 

Gly Ser Arg Gly Phe Pro Leu Arg Leu Gin Ala Thr Glu Val Arg He 

5 10 15 

Cys Pro Val Glu Phe Asn Pro Asn Phe Val Ala Arg Met He Pro Lys 

20 25 30 

Val Glu Trp Ser Ala Phe Leu Glu Ala Xaa Asp Asn Leu Arg Leu He 

35 40 45 

Gin Val Pro Arg Arg Ala Gly 
50 55 



<210> 439 

<211> 99 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 



-349- 



<222> -24 . . -1 

<400> 439 

Met Lys Ser Ala 

Ser Leu Asn Thr 
-5 

lie Cys Gly Asp 
10 

Ser Cys Tyr Glu 
25 

Arg Cys Glu Thr 

Phe Lys Leu Lys 
60 

Pro Pro Arg 
75 



Lys Leu Gly Phe 
-20 

Leu Leu Leu Gly 

Leu Lys Asp Pro 
15 

Val His Phe Arg 
30 

Phe Val Phe Ser 
45 

lie Glu Arg Glu 



Leu Leu Arg Phe 
-15 

Gly Val Asn Lys 
1 

Cys Lys Leu Asp 
20 

Tyr Phe Tyr Asn 
35 

Ser Cys Asn Gly 
50 

Val Xaa Cys Val 
65 



Phe lie Phe Cys 
-10 

lie Ala Glu Lys 
5 

Met Asn Phe Gly 

Arg Thr Ser Lys 
40 

Asn Leu Asn Asn 
55 

Ala Lys Tyr Lys 
70 



<210> 440 

,.^211> 169 

"^212 > PRT 

•J4;213> Homo sapiens 

hk220> 

yi:221> SIGNAL 

4S:222> -25 . . -1 

yi:400> 440 



Wet 


Arg 


Lys Pro 


Ala 


Ala 


Gly Phe Leu 


Pro 


Ser 


Leu 


Leu 


Lys 


Val 


Leu 


r -25 








-20 






-15 








-10 




Leu 


Pro Leu 


Ala 
-5 


Pro 


Ala Ala Ala 


Gin 


Asp 


Ser 


Thr 


Gin 


Ala 


Ser 


^"*rhr 


Pro 


Gly Ser 


Pro 


Leu 


Ser Pro Thr 


1 

Glu 


Tyr Gin Arg 


5 

Phe 


Phe 


Ala 






10 






15 








20 








^||jeu Leu 


Thr Pro 


Thr 


Trp 


Lys Ala Glu 


Thr 


Thr 


Cys 


Arg 


Leu 


Arg 


Ala 




25 








30 






35 








Thr 


His 


Gly Cys 


Arg 


Asn 


Pro Thr Leu 


Val 


Gin 


Leu 


Asp 


Gin 


Tyr 


Glu 


40 








45 






50 






55 


Asn 


His 


Gly Leu 


Val 


Pro 


Asp Gly Ala Val 


Cys 


Ser 


Asn 


Leu 


Pro 


Tyr 








60 






65 










70 


Ala 


Ser 


Trp Phe 


Glu 


Ser 


Phe Cys Gin 


Phe 


Thr 


His 


Tyr Arg Cys 


Ser 






75 






80 










85 






Asn 


His 


Val Tyr 


Tyr Ala 


Lys Arg Val 


Leu 


Cys 


Ser 


Gin 


Pro 


Val 


Ser 






90 






95 








100 








He 


Leu 


Ser Pro 


Asn 


Thr 


Leu Lys Glu 


He 


Glu 


Xaa 


Ser 


Ala 


Glu 


Val 




105 








110 






115 










Ser 


Pro 


Thr Thr 


Asp Asp 


Leu Pro His 


Leu 


Thr 


Pro 


Leu 


His 


Ser 


Asp 


120 








125 






130 










135 


Arg 


Thr 


Pro Asp 


Leu 


Pro 


Ala Leu Ala 






















140 





















<210> 441 
<211> 167 
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<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -76. .-1 



<400> 441 



Met 


Gly 


Asp 


Tyr Leu 


Leu Arg Gly Tyr 


Arg Met Leu Gly 


Glu 


Thr 


Cys 




-75 








-70 






-65 






Ala Asp 


Cys 


Gly Thr 


He 


Leu 


Leu 


Gin 


Asp Lys Gin Arg 


Lys 


He 


Tyr 


-60 








-55 








-50 






-45 


Cys 


Val 


Ala 


Cys Gin 


Glu 


Leu 


Asp 


Ser 


Asp Val Asp Lys 


Asp 


Asn 


Pro 








-40 










-35 




-30 




Ala 


Leu 


Asn 


Ala Gin 


Ala 


Ala 


Leu 


Ser 


Gin Ala Arg Glu 


His 


Gin 


Leu 








-25 








-20 




-15 






Ala 


Ser 


Ala 
-10 


Ser Glu 


Leu 


Pro 


Leu 
-5 


Gly 


Ser Arg Pro Ala 


Pro 


Gin 


Pro 


Pro 


Val 


Pro 


Arg Pro 


Glu 


His 


Cys 


Glu 


1 

Gly Ala Ala Ala 


Gly 


Leu 


Lys 


5 








10 








15 






20 


Ala 


Ala 


Gin 


Gly Pro 


Pro 


Ala 


Pro 


Ala 


Val Pro Pro Asn 


Thr 


Xaa 


Val 








25 










30 




35 




;^Met Ala 


Cys 


Thr Gin 


Thr 


Ala 


Leu 


Leu 


Gin Lys Leu Thr 


Trp 


Ala 


Ser 








40 








45 




50 






fila 


Glu 


Leu 


Gly Ser Xaa Thr 


Ser 


Xaa 


Gly Lys Xaa Ala 


Ser 


Ser 


Cys 






55 








60 




65 






ma.1 


Ala 


Leu 


Ser Ala 


His 


Val 


Arg 


Arg 


Pro Cys Ala Ala 


Cys 


Ser 


Ser 




70 








75 






80 








Ser 


Thr 


Lys Arg 


Ser 


Pro 















yfs 90 



r<:210> 442 

f<:211> 70 

'H212> PRT 

^JJ:213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -15. .-1 

<400> 442 

Met He Leu Cys Phe Leu Leu Pro His His Arg Leu Gin Glu Ala Arg 
-15 -10 -5 1 

Gin He Gin Val Leu Lys Met Leu Pro Arg Glu Lys Leu Arg Arg Arg 

5 10 15 

Glu Glu Arg Lys Gin He Asn Gly Lys Lys Xaa Arg Thr Lys Tyr Glu 

20 25 30 

Thr Pro Arg Lys Xaa Xaa Gly Lys Lys Gly Gly Asn Xaa Xaa Xaa Xaa 

35 40 45 

Xaa Leu Ser Lys Arg Asp 
50 55 



<210> 443 



-351- 



<211> 381 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -33 . . -1 

<400> 443 

Met Ser Trp Thr Val Pro Val Val Arg Ala Ser Gin Arg Val Ser Ser 

-30 -25 -20 

Val Gly Ala Asn Xaa Leu Cys Leu Gly Met Ala Leu Cys Pro Arg Gin 

-15 -10 -5 

Ala Thr Arg He Pro Leu Asn Gly Thr Trp Leu Phe Thr Pro Val Ser 

15 10 15 

Lys Met Ala Thr Val Lys Ser Glu Leu He Glu Arg Phe Thr Ser Glu 

20 25 30 

Lys Pro Val His His Ser Lys Val Ser He He Gly Thr Gly Ser Val 

35 40 45 

Gly Met Ala Cys Ala He Ser He Leu Leu Lys Gly Leu Ser Asp Glu 
50 55 60 

,^,}jeu Ala Leu Val Asp Leu Asp Glu Xaa Lys Leu Lys Gly Glu Thr Met 
% 65 70 75 

JjAsp Leu Gin His Gly Ser Pro Phe Thr Lys Met Pro Asn He Val Cys 
I'^O 85 90 95 

j=^feer Lys Xaa Tyr Phe Val Thr Ala Asn Ser Asn Leu Val He He Thr 

100 105 110 

,:|2^1a Gly Ala Arg Gin Xaa Lys Gly Glu Thr Arg Leu Asn Leu Xaa Gin 
yj 115 120 125 

yflrg Asn Val Ala He Phe Lys Leu Met He Ser Ser He Val Gin Tyr 
n; 135 140 

I^Ser Pro His Cys Lys Leu He He Val Ser Asn Pro Val Asp He Leu 
Ui 145 150 155 

J^thr Tyr Val Ala Trp Lys Leu Ser Ala Phe Pro Lys Asn Arg He He 
f^60 165 170 175 

"'^ly Ser Gly Cys Asn Leu He Xaa Ala Arg Phe Arg Phe Leu He Gly 
m 180 185 190 

yjpin Lys Leu Gly He His Ser Glu Ser Cys His Gly Trp He Leu Gly 
195 200 205 

Glu His Gly Asp Ser Ser Val Pro Val Trp Ser Gly Val Asn He Ala 

210 215 220 

Gly Val Pro Leu Lys Asp Leu Asn Ser Asp He Gly Thr Asp Lys Asp 

225 230 235 

Pro Glu Gin Trp Lys Asn Val His Lys Glu Val Thr Ala Thr Ala Tyr 
240 245 250 255 

Glu He He Lys Met Lys Gly Tyr Thr Ser Trp Ala He Gly Leu Ser 

260 265 270 

Val Ala Asp Leu Thr Glu Ser He Leu Lys Asn Leu Arg Arg He His 

275 280 285 

Pro Val Ser Thr He Thr Lys Gly Leu Tyr Gly He Xaa Glu Glu Val 

290 295 300 

Phe Leu Ser He Pro Cys He Leu Gly Glu Asn Gly He Thr Asn Leu 

305 310 315 

He Lys He Lys Leu Thr Pro Glu Glu Glu Ala His Leu Lys Lys Ser 
320 325 330 335 

Ala Lys Thr Leu Trp Glu He Gin Asn Lys Leu Lys Leu 
340 345 



-352- 



<210> 444 
<211> 39 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -14. .-1 

<400> 444 

Met Tyr Tyr Met Val Cys Leu Phe Phe Arg Leu He Phe Ser Glu His 

-10 -5 1 

Leu Pro He He Gly Thr Val Thr Ser His Lys Thr Gly Thr Leu Thr 

5 10 15 

Val Tyr Pro Thr Ser Ala Gly 
20 25 



^^j;l:210> 445 

!;^'1:211> 50 

^11:2 12 > PRT 

r^^213> Homo sapiens 

.p220> 

ij<221> SIGNAL 
jj1:222> -37. . -1 

S'^:<400> 445 

L,|let Val Leu Thr Thr Leu Pro Leu 

rf -30 

^ket Pro Thr Thr Gly Pro Asn Ser 
%l -20 -15 
^i||er Pro Cys Leu Thr Ala Pro Lys 

Asp Asn 



Pro Ser Ala Asn Ser Pro Val Asn 
-25 

Leu Ser Tyr Ala Ser Ser Ala Leu 
-10 

Ser Pro Arg Leu Ala Met Met Pro 
5 10 



<210> 446 
<211> 51 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -26. . -1 

<400> 446 

Met Thr Pro Trp Cys Leu Ala Cys Leu Gly Arg Arg Pro Leu Ala Ser 

-25 -20 -15 

Leu Gin Trp Ser Leu Thr Leu Ala Trp Cys Gly Ser Gly Ser His Trp 
-10 -5 1 5 

Thr Glu Arg Pro Xaa Gin Xaa Ser Pro Trp Xaa Ser Leu Ser Ala Thr 



-353- 



10 

Thr Arg Gly 
25 



<210> 447 
<211> 242 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -30. . -1 



<400> 447 



Met Gly Glu Ala 


Ser 


Pro 


Pro 


Ala 


-30 




-25 






Leu Leu Leu Leu 


Leu 


Ser 


Thr 


Leu 




-10 








lie His Asp Ala 


Asp 


Ala 


Gin 


Glu 


5 








10 


^==jGln Ser Leu Leu 


Gin Gly Phe 


Ser 


y3 20 






25 




iljljeu Arg Gly lie 


Asp 


Ser 


Leu 


Phe 






40 






irpeu. Pro Gly Asn 


Tyr 


His 


Lys 


Glu 




55 








ilhsn Asn Thr Leu 


Ser 


Ser 


His 


Leu 


111 70 










"'Lys Thr Gly Glu 


Val 


Leu 


He 


Ser 


L 85 








90 


j'^ro Xaa Glu Gly 


Xaa 


Phe 


Glu Gly 


pol 100 






105 




r'kn.u Lys Glu Ala 


Leu 


Val 


Pro 


Xaa 


■^ii5 




120 






ijlrhr Glu Leu His 


Pro 


Arg 


Val 


Ala 


i J Ji 


135 








Arg Arg Ser His 


Gin Asp Ala Leu 


150 










Lys Arg His Arg 


Leu 


Gin 


Ala 


He 


165 








170 


His Lys Asp Xaa 


Leu 


Xaa 


Xaa Gly 


180 






185 




Leu Ser Pro Arg 


Lys 


Xaa 


His 


Leu 


195 




200 







Gin Leu 



<210> 448 
<211> 154 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 



15 20 



Pro 


Ala 


Arg 


Arg His Leu 


Leu Val 






-20 




-15 


Val 


He 


Pro 


Ser Ala Ala 


Ala Pro 




-5 






1 


Ser 


Ser 


Leu 


Gly Leu Thr Gly Leu 








15 




Arg 


Leu 


Phe 


Leu Lys Gly Asn Leu 








30 




Ser 


Ala 


Pro 


Met Asp Phe 


Arg Gly 






45 




50 


Glu 


Asn 


Gin 


Glu His Gin 


Leu Gly 




60 






65 


Gin 


He 


Asp 


Lys Met Thr 


Asp Asn 


75 






80 




Glu 


Asn 


Val 


Val Ala Ser 


He Gin 








95 




Asp 


Leu 


Lys 


Val Pro Arg 


Met Glu 








110 




Gin 


Lys 


Ala 


Thr Asp Ser 


Phe His 






125 




13 0 


Phe 


Trp 


He 


He Lys Leu 


Pro Arg 




140 






145 


Glu 


Gly Gly 


His Trp Leu 


Xaa Glu 


155 






160 




Arg 


Asp Gly 


Leu Arg Lys 


Gly Thr 








175 




Thr 


Glu 


Ser 


Ser Ser His 


Ser Arg 








190 




Leu 


Tyr 


He 


Leu Xaa Pro 


Ser Arg 






205 




210 
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<222> -60 . . -1 



<400> 448 



Met 


Gly Sex Lys 


Cys 


Cys 


Lys Gly Gly 


Pro 


Asp 


Glu 


Asp Ala Val 


Glu 


-60 






-55 










-50 






-45 


Arg Gin Arg Arg 


o±n 


Lys 


Leu 


Leu 


Leu 


Ala 


Gin 


Leu 


His His Arg 


Lys 






-40 










-35 






-30 


Arg 


Val Lys Ala 


Ala 


Gly Gin 


He 


Gin 


Ala 


Trp 


Trp 


Arg Gly Val 


Leu 




-25 










-20 








-15 




Val 


Arg Arg Thr 
-10 


Leu 


Leu 


Val 


Ala 
-5 


Ala 


Leu Arg 


Ala 


Trp Met He 


Gin 


Cys 


Trp Trp Arg 


Thr 


Leu 


Val 


Gin 


Arg 


Arg 


He 


Arg 


1 

Gin Arg Arg 


Gin 


5 






10 










15 




20 


Ala 


Leu Leu Gly 


Val 


Tyr Val 


He 


Gin 


Glu 


Gin 


Ala 


Ala Val Lys 


Leu 






25 










30 






35 




Gin 


Ser Cys lie 


Arg Met 


Trp Gin Cys 


Arg 


Gin 


Cys 


Tyr Arg Gin Met 




40 










45 








50 




Cys 


Asn Ala Leu 


Cys 


Leu 


Phe 


Gin 


Val 


Pro 


Lys 


Ser 


Ser Leu Ala 


Phe 




55 








60 










65 




Gin 


Thr Asp Gly 


Phe 


Leu 


Gin 


Val 


Gin 


Tyr Ala 


He 


Pro Ser Lys 


Gin 




70 






75 










80 




liPro 


Glu Phe His 


He 


Glu 


He 


Leu 


Ser 


He 











^rSStJ.^^ \JJ.U. Cll^ XXXO kJJL 

;;|5 90 



Uk210> 449 

4<211> 89 

bj:212> PRT 

y%213> Homo sapiens 

L|:220> 

Uf221> SIGNAL 
■^f:222> -61. . -1 

'i400> 449 

^;J§Iet Asn Ala Ala He Asn Thr Gly Pro Ala Pro Ala Val Thr Lys Thr 
W -60 -55 -50 

Glu Thr Glu Val Gin Asn Pro Asp Val Leu Trp Asp Leu Asp He Pro 
-45 -40 -35 -30 

Glu Ala Arg Ser His Ala Asp Gin Asp Ser Asn Pro Lys Ala Glu Ala 

-25 -20 -15 

Leu Leu Pro Cys Asn Leu His Cys Ser Trp Leu His Ser Ser Pro Arg 

-10 -5 1 

Pro Asp Pro His Ser His Phe Pro Ser Xaa Arg Arg Cys Pro Leu Pro 

5 10 15 

His Pro Cys Ala Thr Tyr Pro Pro Xaa 
20 25 



<210> 450 
<211> 73 
<212> PRT 

<213> Homo sapiens 



<220> 



-355- 



<221> SIGNAL 
<222> -26 . . -1 

<400> 450 
Met Arg Met Ser 
-25 

Leu Leu Phe Leu 
-10 

Trp Asn Trp Phe 
10 

Leu Leu Val Leu 
25 

Phe Asp Leu Asp 
40 



Leu Ala Gin Arg 
-20 

He Met Leu Val 
-5 

Leu He Phe He 

Leu He Val Lys 
30 

Met Asp His Thr 
45 



Val Leu Leu Thr 
-15 

Leu Lys Leu Asp 
1 

Pro Val Trp He 
15 

Met Ala Gly Arg 
He 



Trp Leu Phe Thr 

Glu Lys Ala Pro 
5 

Phe Asp Thr He 
20 

Cys Lys Ser Gly 
35 



<210> 451 

<211> 54 

<212> PRT 

<213> Homo sapiens 

:J^f22 0> 

f^221> SIGNAL 
p4:222> -34. .-1 

y'l:400> 451 

^l&Iet He Pro Leu He Ser His Leu Ala Glu Ala Ala Pro Pro Thr Ser 
W -30 -25 -20 

y-|?rp Ser Leu He Ser Ser Val Leu Asn Val Gly His Leu Leu Phe Ser 
:ii -15 -10 -5 

yper Ala Cys Ser Val Ser Leu Glu Ala Leu Ser Thr Arg Asn He Lys 
nil 15 10 

He He Leu Met Lys 
T%S 20 



<210> 452 

<211> 121 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 

<222> -38. .-1 

<400> 452 

Met Glu Ser Pro Gin Leu His Cys He Leu Asn Ser Asn Ser Val Ala 

-35 -30 -25 

Cys Ser Phe Ala Val Gly Ala Gly Phe Leu Ala Phe Leu Ser Cys Leu 

-20 -15 -10 

Ala Phe Leu Val Leu Asp Thr Gin Glu Thr Arg He Ala Gly Thr Arg 

-5 15 10 

Phe Lys Thr Ala Phe Gin Leu Leu Asp Phe He Leu Ala Val Leu Trp 

15 20 25 

Ala Val Val Trp Phe Met Gly Phe Cys Phe Leu Ala Asn Gin Trp Gin 



-356- 



30 

His Ser Pro Pro Lys Glu Xaa Leu 

45 50 
Ala lie Gly Xaa His Leu Leu Leu 

60 65 
Leu Pro Gly Xaa Pro Gly Pro Pro 
75 80 



35 40 
Leu Gly Ser Ser Ser Ala Gin Ala 
55 

His Pro Cys Leu Asp He Pro Xaa 

70 

Lys 



<210> 453 
<211> 166 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -37. .-1 



<400> 453 
















Met Ser 


Thr 


Val 


Gly Leu Phe 


His Phe Pro 


Thr Pro Leu 


Thr 


Arg 


He 




-35 






-30 


-25 






Iflys Pro 


Ala 


Pro 


Trp Gly Leu 


Arg Leu Trp 


Glu Lys Leu 


Thr 


Leu 


Leu 


-2 0 






-15 




-10 








^^^er Pro 


Gly 


He 


Ala Val Thr 


Pro Val Gin 


Met Ala Gly 


Lys 


Lys 


Asp 








1 


5 






10 


Utyr Pro 


Ala 


Leu 


Leu Ser Leu 


Asp Glu Asn 


Glu Leu Glu 


Glu 


Gin 


Phe 


hpsil Lys 




15 




20 




25 






Gly 


His 


Gly Pro Gly 


Gly Gin Ala 


Thr Asn Lys 


Thr 


Ser 


Asn 




30 






35 


40 








I Cys Val 


Val 


Leu 


Lys Xaa He 


Pro Ser Gly He Val Val 


Lys 


Cys 


His 


U 45 






50 




55 






l^Qln Thr 


Arg 


Ser 


Val Asp Gin 


Asn Arg Lys 


Leu Ala Arg 


Lys 


He 


Leu 








65 




70 




75 


j'^ln Glu 


Lys 


Val 


Xaa Val Phe 


Tyr Asn Gly Glu Asn Ser 


Pro 


Val 


His 








80 


85 






90 




^Jlys Glu 


Lys 


Arg 


Glu Ala Ala 


Lys Lys Lys 


Gin Glu Arg 


Lys 


Lys 


Arg 






95 




100 




105 






Ala Lys 


Glu 


Thr 


Leu Glu Lys 


Lys Xaa Leu 


Leu Lys Xaa 


Leu Trp Glu 




110 






115 


120 








Ser Ser 


Lys 


Lys 


Val His 












125 



















<210> 454 
<211> 180 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -26. . -1 

<400> 454 

Met Gly He Gin Thr Ser Pro Val Leu Leu Ala Ser Leu Gly Val Gly 
-25 -20 -15 



-357- 





Val 


Til IT 




J_lC Li. 


Gly Leu 


iixa vax (jjxy oer iyr Lieu val 


Arg Arg 


-1 n 










- 0 




1 


5 






r-LI. y 


It J. ^ 


m n 


Val 


Thr 


j-ieu ijeu Asp Fro Asn (jiu Lys 


Tyr Leu 
















lb 20 








Leu. 


Hieui 




Lys 


Thr 


iiir va± oer rixs Asn iiir iiys 


Arg Phe 
















30 35 










T .^1 1 
J-itr U. 


irlTO 


Thr 


TV 1 — . 

Ala 


Hi s Hi s Thr Leu Gly Leu Pro 


Val Gly 
















50 






X X c 




Leu 


Ser 


Thr 


Arg lie Asp Cjiy ber Leu val 


lie Arg 


D D 










b U 




65 


70 


Puro 




T]i3r 




V CtX 


Thr 


Ser 


Asp valu Asp Qjin tiiy ryr val 


Asp Leu 










/b 






80 


85 


V CtX 




ijy iD 


Va.-L 


Tyr 


Leu 


Lys 


Gly Val His Pro Lys Phe Pro 


Glu Gly 








y u 








95 100 






Lys 




Seir 




Tyr 


Leu 


Asp Xaa Leu Lys Val Gly Asp 


Xaa Val 






105 










110 115 




Glu 


Phe 


Xaa 


Gly 


Pro 


Ser 


Gly 


Leu Leu Thr Tyr Thr Gly Lys 


Gly His 




120 










125 


130 




Phe 


Asn 


He 


Gin 


Pro 


Asn 


Lys 


Asn Leu His Gin Asn Pro Glu 


Trp Arg 


135 










140 




145 


150 


Arg 


Asn 


Trp 


Glu 













;;'^210> 455 

^'Jc211> 91 

UTc212> PRT 

4^212> Homo sapiens 

iJ'K220> 

<221> SIGNAL 
L^222> -64. .-1 

r!t400> 455 

\%^t. Thr Pro Arg He Leu Ser Glu Val Gin Phe Ser Ala Phe Cys Pro 
■;f -60 -55 -50 

^j.i'yr Trp Thr He Ala Arg He Leu Glu Arg Val Gly Ser Ala Cys Phe 
Oil -45 -40 -35 

Arg Leu Glu Leu Cys Ala Ala He Val Gly Tyr Phe Val Leu Asp Val 

-30 -25 -20 

Arg Thr Phe Leu Phe He Val Val Cys Val He Cys Val Thr Leu Asn 

-15 -10 -5 

Phe Pro Arg Phe Tyr Phe Leu Cys Leu Ser Ser Leu Thr Ala Phe Gly 
15 10 15 

Thr Pro Pro He Gly Val His He Pro Ser Pro 
20 25 



<210> 456 
<211> 257 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -23 . . -1 



-358- 



<400> 456 
Met Arg Arg lie 
-20 

Leu Leu Leu Leu 
-5 

Leu Cys Phe Asn 
10 

Trp Cys Glu Ala 

Asn Ser Asp Asn 
45 

Val Tyr Ala Thr 
60 

Val Gly Arg Asp 
75 

Lys Thr Ser Asp 
90 

Glu Ala Glu Arg 
Glu Lys Ser Leu 

125 

^^;ksn His Glu Ala 

J J 14 Q 

^ieu Glu Xaa Tyr 

jlbeu Arg Glu Phe 
117 0 

j^per Pro Xaa Asn 

Xaa Xaa Trp lie 

h 

, Jle Val Leu He 
220 

'^kaa 



Ser Leu Thr Ser 

Leu He Ala Leu 
1 

Phe Thr lie Lys 
15 

His Val Phe Leu 
30 

Asn Met Val Lys 

Ser Thr Trp Gly 
65 

Leu Arg Met Leu 
80 

Pro Ser Thr Leu 
95 

Cys Thr Gly Ala 
110 

Leu Phe Asp Ala 

Ser Xaa He Lys 
145 

Phe Arg Lys Leu 
160 

Leu Gly His Trp 
175 

Ala Ser Xaa lie 
190 

He Leu Gly Ala 

Cys Val Trp Trp 
225 



Ser Pro Val Arg 
-15 

Glu He Met Val 
5 

Ser Leu Ser Arg 
20 

Asn Lys Asn Leu 
35 

Pro Leu Gly Leu 
50 

Glu Leu Thr Gin 

Leu Cys Asp He 
85 

Gin Val Xaa Xaa 
100 

Ser Trp Gin Phe 
115 

Met Asn Met Thr 
130 

Glu Thr Trp Lys 

Ser Lys Gly Asp 
165 

Glu Ala Met Pro 
180 

His Trp Ser Ser 
195 

Phe He Leu Leu 
210 

Gin Asn Gly Xaa 



Leu Leu Leu Xaa 
-10 

Gly Gly His Ser 

Pro Gly Gin Pro 
25 

Phe Leu Gin Tyr 
40 

Leu Gly Lys Lys 
55 

Thr Leu Gly Glu 
70 

Lys Pro Gin He 

Phe Cys Gin Arg 
105 

Ala Thr Asn Gly 
120 

Trip Thr Val He 
135 

Lys Asp Arg Xaa 
150 

Cys Asp His Trp 

Xaa Pro Xaa Val 
185 

Ser Xaa Leu Pro 
200 

Xaa Leu Met Gly 
215 

Xaa Ser Thr Xaa 
230 



<210> 457 


























<211> 193 


























<212> PRT 


























<213> Homo sapiens 
























<220> 


























<221> SIGNAL 


























<222> -60. . -1 


























<400> 457 


























Met Cys Pro Ser 


Leu 


Glu 


Glu 


Ala 


Pro 


Ser 


Val 


Lys 


Gly Thr Leu Pro 


-60 




-55 










-50 










-45 


Cys Ser Gly Gin 


Gin 


Gin 


Pro 


Phe 


Pro 


Phe 


Gly Ala 


Ser 


Asn 


He 


Pro 




-40 










-35 










-30 




Leu Leu Leu Gly 


Arg 


Ser 


Arg 


Lys 


Val 


Ala 


Arg 


Gly Ala 


Pro 


Val 


Leu 


-25 










-20 










-15 






Trp Pro Phe Leu 


Thr 


Trp 


He 


Asn 


Pro 


Ala 


Leu 


Ser 


He 


Cys 


Asp 


Pro 


-10 








-5 










1 




Leu Gly Ser Cys 


Gly Trp Xaa 


Cys 


His 


Thr 


Ala 


Gin 


Val 


Pro 


Ala 


Pro 



-359- 



c 










10 










15 


20 


Leu 


Gin 


Leu 


Pro 


Thr 

ZD 


Ala 


Cys 


Pro 


Pro 


Leu 
30 


Pro 


His Gly Thr Arg Ala 
35 


V CtX 






Tiir 


Pro 


Gly Leu Leu Pro Glu Ala Ala Ala Pro Xaa Thr 


















45 






50 


Xaa 


Gly 


Ala 

DO 


Leu 


Ser 


Ser 


Arg 


Ser 
60 


Arg 


His 


Trp 


Ser Cys Ser lie Val 
65 


Xaa 


Cys 


Leu 


His 


Leu 


His 


Xaa 
75 


Leu 


Leu 


Ser 


Val 


Glu Thr Arg Xaa Phe 
80 


Xaa 


Lys 


His 


Leu 


Leu 


Val 


Leu 


Leu 


Val 


Ala 


Val 


Ala His Ser Val Leu 


85 










90 










95 


100 


Glu 


Pro 


Pro 


Ala 


Leu 
105 


Val 


Pro 


Asn 


Val 


Gin 
110 


Cys 


Glu Met Cys Thr His 
115 


Ser 


Gly 


Pro 


Arg 
120 


Asp 


Leu 


Glu 


Ala 


Ala 
125 


Val 


Val 


Ser Pro Ala Pro Trp 
130 



Glu 



<210> 458 

<211> 107 
,,,,<212> PRT 
'=^1^213 > Homo sapiens 

Vk220> 

H^221> SIGNAL 
iri:222> -28, . -1 

hj:400> 458 

Ijl^et Val Leu Thr Leu Gly Glu Ser Trp Pro Val Leu Val Gly Arg Arg 

I " -25 -20 -15 

Lf^^ Leu Ser Leu Ser Ala Ala Asp Gly Ser Asp Gly Ser His Asp Ser 

hi '^^ 1 

yfrp Asp Val Glu Arg Val Ala Glu Trp Pro Trp Leu Ser Gly Thr lie 

10 15 20 

^;4rg Ala Val Ser His Thr Asp Val Thr Lys Lys Asp Leu Lys Val Cys 
J!l 25 3 0 35 

|iVal Glu Phe Xaa Gly Glu Ser Trp Arg Lys Arg Arg Trp He Glu Val 
40 45 50 

Tyr Ser Leu Leu Arg Lys Ala Phe Leu Val Lys His Asn Leu Val Leu 

55 60 65 

Ala Glu Arg Lys Ser Pro Glu He Ser Trp Gly 
70 75 



<210> 459 
<211> 121 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -13 . . -1 

<400> 459 

Met Leu Val Leu Arg Ser Ala Leu Thr Arg Ala Leu Ala Ser Arg Thr 



-360- 









-10 










-5 


1 




Leu 


Ala 


Pro 


Gin 


Met 


Cys 


Ser 


Ser 


Phe Ala Thr Gly Pro 


Arg Gin 


Tyr 




5 










10 




15 






Asp 


Gly 


lie 


Phe 


Tyr 


Glu 


Phe 


Arg 


Ser Tyr Tyr Leu Lys 


Pro Ser 


Lys 


20 










25 






30 




35 


Met 


Asn 


Glu 


Phe 


Leu 


Glu 


Asn 


Phe 


Glu Lys Asn Ala Gin 


Leu Arg 


Thr 










40 








45 


50 




Ala 


His 


Ser 


Glu 


Leu 


Val 


Gly Tyr Trp Ser Val Xaa Phe 


Gly Gly Arg 








55 










60 


65 




Met 


Xaa 


Thr 


Val 


Phe 


His 


He 


Trp 


Lys Tyr Asp Asn Phe 


Ala His 


Arg 






70 










75 


80 






Thr 


Glu 


Phe 


Gin 


Lys 


Ala 


Leu 


Ala 


Lys Asp Lys Glu Trp 


Gin Glu 


Gin 




85 










90 




95 






Phe 


Leu 


He 


Pro 


Asn 


Leu 


Ala 


Leu 


Asn 






100 










105 













<210> 460 

<211> 44 

<212> PRT 

'n^213> Homo sapiens 

fi220> 

^;^221> SIGNAL 
''i<:222> -17 , . -1 

IS:400> 460 

JjHet Lys Val Gly Val Leu Trp Leu He Ser Phe Phe Thr Phe Thr Asp 
j1 -15 -10 -5 

Gly His Gly Gly Phe Leu Gly Val Ser Trp Cys Tyr Val Ser Tyr Leu 

15 10 15 

iPhe Ser Thr Asn Ser Pro Leu Ser Phe Arg Arg lie 
T 20 25 



y|:210> 461 
<211> 109 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -13 . . -1 

<400> 461 

Met Cys Leu Leu Thr Ala Leu Val Thr Gin Val He Ser Leu Arg Lys 

-10 -5 1 

Asn Ala Glu Arg Thr Cys Leu Cys Lys Arg Arg Trp Pro Trp Xaa Pro 

5 10 15 

Ser Pro Arg He Tyr Cys Ser Ser Thr Pro Cys Asp Ser Lys Phe Pro 
20 25 30 35 

Thr Val Tyr Ser Ser Ala Pro Phe His Ala Pro Leu Pro Val Gin Asn 

40 45 50 

Ser Leu Trp Gly His Pro Leu His Gly Cys Ser Trp Gin Cys His His 
55 60 65 



-361- 



Pro Gin Gly Gin 
70 

Lys Pro Lys Arg 
85 



Asn Leu Gin Pro 
75 

His Phe Xaa Lys 
90 



Ala Ser Leu Xaa 

Lys Xaa Cys Gin 
95 



Thr His Leu Ser 

80 

Ala 



<210> 462 
<211> 143 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -41 , . -1 

<400> 462 

Met Ala Thr Ala Thr Glu Gin Trp Val Leu Val Glu Met Val Gin Ala 

-40 -35 -30 

Leu Tyr Glu Ala Pro Ala Tyr His Leu lie Leu Glu Gly lie Leu lie 

-25 -20 -15 -10 

fi^eu Trp lie lie Arg Leu Leu Phe Ser Lys Thr Tyr Lys Leu Gin Glu 

^ji^'Arg Ser Asp Leu Thr Val Lys Glu Lys Glu Glu Leu lie Glu Glu Trp 
ll 2 0 

Pro Glu Pro Leu Val Pro Pro Val Pro Lys Asp His Pro Ala Leu 

HI 35 

4ksn Tyr Asn lie Val Ser Gly Pro Pro Ser His Lys Thr Val Val Asn 
Uko 45 50 55 

,fl31y Lys Glu Cys lie Asn Phe Ala Ser Phe Asn Phe Leu Gly Leu Leu 

60 65 70 

,^sp Asn Pro Arg Val Lys Ala Ala Ala Leu Ala Ser Leu Lys Lys Tyr 
U 75 80 85 

Jpiy Val Gly Thr Cys Gly Pro Cys Gly Phe Tyr Gly Thr Phe Glu 
90 95 100 



<210> 463 
<211> 232 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -30 . . -1 

<400> 463 

Met Ala Ala Thr Ser Gly Thr Asp Glu Pro Val Ser Gly Glu Leu Val 
-30 -25 -20 -15 

Ser Val Ala His Ala Leu Ser Leu Pro Ala Glu Ser Tyr Gly Asn Xaa 

-10 -5 1 

Xaa Asp lie Glu Met Ala Trp Ala Met Arg Ala Met Gin His Ala Glu 

5 10 15 

Val Tyr Tyr Lys Leu lie Ser Ser Val Asp Pro Gin Phe Leu Lys Leu 

20 25 30 

Thr Lys Val Asp Asp Gin lie Tyr Ser Glu Phe Arg Lys Asn Phe Glu 



-362- 



35 










40 








45 










50 


Thr 


Leu 


Arg 


He Asp Val 


Leu Xaa 


Pro 


Glu 


Xaa 


Leu 


Lys 


Ser 


Glu 


Ser 










55 








60 










65 




71 1 

Aia 


Lys 


Glu 


Pro 


Pro 


Gly Tyr Asn Ser Leu Pro Leu Lys Leu Leu Gly 








70 








75 










80 






Thir 


Cjiy 


Lys 


Ala 


He 


Thr 


Lys Leu 


Phe 


He 


Ser 


Val 


Phe 


Arg 


Thr 


Lys 






85 








90 










95 








Lys 


Glu 


Arg 


Lys 


Glu 


Ser 


Thr Met 


Glu 


Glu 


Lys 


Lys 


Glu 


Leu 


Thr 


Val 




100 










105 








110 










Glu 


Lys 


Lys 


Arg 


Thr 


Pro 


Arg Met 


Glu 


Glu 


Arg 


Lys 


Glu 


Leu 


He 


Val 


115 










120 








125 










130 


Glu 


Lys 


Lys 


Lys 


Arg 


Lys 


Glu Ser 


Thr 


Glu 


Lys 


Thr 


Lys 


Leu 


Thr 


Lys 










135 








140 










145 




Glu 


Glu 


Lys 


Lys Gly Lys 


Lys Leu 


Thr 


Lys 


Lys 


Ser 


Thr 


Lys 


Val 


Val 








150 








155 










160 






Lys 


Lys 


Leu 


Cys 


Lys 


Val 


Tyr Arg Glu Gin His 


Ser Arg Ser Tyr Asp 






165 








170 










175 








Ser 


He 


Glu 


Thr 


Thr 


Ser 


Thr Thr 


Val 


Leu 


Leu 


Ala 


Gin 


Thr 


Pro 


Leu 




180 










185 








190 










Val 


Lys 


Cys 


Lys 


Phe 


Leu Tyr Asn 


















195 










200 





















;^^s;210> 464 

^';<:211> 61 

^^^212> PRT 

4i<213> Homo sapiens 

y1=220> 

<221> SIGNAL 
L|<222> -21. . -1 

!^J:400> 464 

J'l^et Thr Phe Arg His Gin Asp Asn Ser Leu Met Phe Phe Ser Met Met 

■jf "15 -10 

^^JAla Thr Cys Thr Ser Asn Val Gly Phe Thr His Thr Thr Met Asn Cys 
m 1 5 10 

Ser Leu Thr Ser Pro Val Asp Phe Lys Asp Leu Leu Arg Val Leu Leu 

15 20 25 

He Lys Phe Gly Tyr Asp Arg Lys Ser Thr He Lys Ser 
30 35 40 



<210> 465 

<211> 34 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -19. .-1 



<400> 465 

Met Phe Leu Lys Ser Gly Ala Gly Leu Ser Ser Cys Leu Leu Pro Leu 
-15 -10 -5 



-363- 



Cys Trp Leu Glu Arg Lys Asp His Gly Arg Arg Pro Ser Xaa His Pro 
15 10 

Gly Arg 
15 



<210> 466 

<211> 215 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 

<222> -54. ,-1 

<400> 466 



Met 


Asn 


Xaa 


Tyr 


Ala 


Ser 


Pro 


Phe 


Asn 


Xaa Gin 


Leu 


Xaa 


Tyr 


Leu 


Xaa 










-50 










-45 








-40 




Leu 


Ser 


Arg 


Phe 


Glu 


Cys 


Val 


His 


Arg Asp Gly Arg Val 


He 


Thr 


Leu 








-35 










-30 








-25 






,.^er 


Tyr 


Gin 


Glu 


Gin 


Glu 


Leu 


Gin 


Asp 


Phe Leu 


Leu 


Ser 


Gin 


Met 


Ser 


, 




-20 










-15 








-10 








•f'pln His 


Gin 


Val 


His 


Ala 


Val 


Gin 


Gin 


Leu Ala 


Lys 


Val 


Met 


Gly Trp 




-5 










1 






5 










10 


l^=feln 


Val 


Leu 


Ser 


Phe 


Ser 


Asn 


His 


Val 


Gly Leu Gly Pro 


He 


Glu 


Ser 


43caa 








15 










20 








25 




Gly 


Asn 


Ala 


Ser 


Ala 


He 


Thr 


Val 


Ala Pro 


Gin 


Val 


Val 


Thr 


Met 








30 










35 








40 






HpXieu 

•;;s 


Phe 


Gin 


Phe 


Val 


Met 


Asp 


Leu 


Lys 


Val Ala 


Ala 


Arg 


Leu 


Trp 


Phe 






45 










50 








55 








=,_sSer 


Phe 


Leu 


Val 


Thr 


Asn 


Val 


Lys 


Thr 


Phe Gin 


Lys 


Val 


Met 


Phe 


Tyr 


rtys 


60 










65 








70 








He 


Thr 


Asn 


Gly Val 


He 


Phe 


Val 


Gly His 


Ser 


Lys 


Lys 


Phe 


Ser 


:'t75 










80 








85 








90 


^Gly He 


Lys 


Trp 


Lys 


Val 


Xaa 


He 


Leu 


Phe He 


Lys 


Trp 


Xaa 


Cys 


Leu 










95 










100 








105 






Leu 


His 


Leu 


Ala 


Leu 


Val 


Tyr 


Tyr 


Asp Phe 


Phe 


Gin 


Met 


Phe 


Pro 








110 










115 








120 






Lys 


Xaa 


Val 


Ser 


Xaa 


Asn 


Phe 


Asp 


Leu 


Lys Cys 


Leu 


Gin 


He 


Asn 


Tyr 






125 










130 








135 






Lys 


His 


Lys 


Glu 


Glu 


He 


Thr 


Ser 


Lys Arg Val 


Leu 


Phe 


Leu 


Lys 


He 




140 










145 








150 








He 


He 


Arg 


Lys 


Cys 


Phe 


He 



















155 160 



<210> 467 
<211> 27 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -17. . -1 



-364- 



<400> 467 

Met Val Val His Leu Leu Tyr Ala His Leu Ser Phe Thr Ser Lys Arg 

-15 -10 -5 

Ala Val Val Met Leu Lys Leu Glu lie Thr Phe 
15 10 



<210> 468 
<211> 85 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -24, .-1 

<400> 468 

Met Cys Ser His Ala Ser Met Ser Phe His Thr Leu Phe His Leu Leu 
-20 -15 -10 

Phe Leu Pro His Tyr lie Glu Thr Phe Lys Pro Gin Ser Lys His Cys 

^.^1 -5 1 5 

■;J>he Phe Trp He Ala Ala Phe Leu Thr Ser Leu Leu Thr Pro Gin Ser 
ii; 10 15 20 

=ieu Gin Gly Phe His Ser Ser Leu Cys Ala Leu Arg Ser Gin His Phe 
3^ 30 35 40 

j|>ro Ser Thr Cys Asn Cys Phe Cys Tyr Leu Thr He He Ala Leu Xaa 
P 45 50 55 

j^lTyr Trp Asp Asn Leu 

J1 60 



"^;^210> 469 

'<211> 51 

'^1:2 12 > PRT 

lil:213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -16 . . -1 

<400> 469 

Met Leu Arg He Ala Leu Thr Leu He Pro Ser Met Leu Ser Arg Ala 

-15 -10 -5 

Ala Gly Trp Cys Trp Tyr Lys Glu Pro Thr Gin Gin Phe Ser Tyr Leu 
15 10 15 

Cys Leu Pro Cys Leu Ser Trp Asn Lys Lys Gly Asn Val Leu Gin Leu 
20 25 30 

Pro Asn Phe 
35 



<210> 470 
<211> 67 
<212> PRT 



-365- 



<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -43 . . -1 



<400> 470 

Met Thr Pro Gin Tyr Leu Pro His 
-40 

Asp Tyr Ser Leu Ala Val Val Phe 
-25 -20 
Ser Val Leu Tyr Leu lie Pro Lys 

-10 -5 
Lys His Ser He Gin Lys Asn Cys 
10 

Leu Ser Gin 



Gly Gly Lys Tyr Gin Val Leu Gly 
-35 -30 
Pro Leu His Phe Ser Asp Leu He 
-15 

Thr Leu Thr Thr Asn Thr Ala Val 

1 5 
Met Xaa Leu Val Leu Gly Lys Leu 
15 20 



<210> 471 

<211> 63 
,,,<212> PRT 
':|:213> Homo sapiens 

-k220> 

^^'^221> SIGNAL 
yi:222> -15. .-1 

.J<400> 471 

jfJet Gly He Leu Ser Thr Val Thr Ala Leu Thr Phe Ala Arg Ala Leu 
: -15 -10 -5 1 

,Asp Gly Cys Arg Asn Gly He Ala His Pro Ala Ser Glu Lys His Arg 

5 10 15 

'ieu Glu Lys Cys Arg Glu Leu Glu Ser Ser His Ser Ala Pro Gly Ser 
'1 2 0 2 5 30 

''■khr Gin His Arg Arg Lys Thr Thr Arg Arg Asn Tyr Ser Ser Ala 
111 35 40 45 



<210> 472 
<211> 179 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -58 . . -1 



<400> 472 

Met Ser Thr Gly Gin Leu Tyr Arg 
-55 

Ser Gin Gin Pro Gly Ser Leu Thr 
-40 -35 
He He Tyr Asn Asn Thr Arg Asn 

-25 -20 
Leu Met Gly Ser Phe Gin Gly Thr 



Met Glu Asp He Gly Arg Phe His 
-50 -45 
Pro Ser Ser Pro Thr Val Gly Glu 
-30 

Thr Leu Gly Trp He Gly Gly He 
-15 

He Ala Gly Gin Gly Thr Gly Ala 



-366- 



-10 










-5 










1 5 


Thr 


Ser 


lie 


Ser 


Glu 


Leu 


Cys 


Lys 


Gly Gin Glu Leu Glu Pro Ser Gly 








10 










15 




20 


Ala 


Gly 


Leu 


Thr 


Val 


TV 1 

Ala 


Pro 


Pro 


Gin 


Ala 


Val Ser Leu Gin Gly He 






25 










30 






35 


Tyr 


Thr 


Leu 


Pro 


Trp 


Leu 


Leu 


Gin 


Leu 


Phe 


His Ser Thr Ala Leu Xaa 




40 










45 








50 




xaa 


Gin 


Gin 


Pro 


Asn 


Gly Ser 


Leu 


Ser 


Leu Asn He Ser Ser Ser 


55 










60 










65 70 


His 


Ala 


Pro 


Xaa 


Pro 


Xaa 


Thr 


Cys 


Thr 


Leu 


Glu Pro Gly Val Asp Pro 










75 










80 


85 


Thr 


Arg 


Xaa 


Val 


Cys 


He 


Asn 


Pro 


His 


Pro 


Pro Pro Pro He Leu Lys 








90 










95 




100 


Xaa 


Pro 


Leu 


Ser 


Pro 


Tyr 


Pro 


Lys 


Pro 


Gin 


Leu Gly Thr His Ala Gly 






105 










110 






115 


Gin 


Val 


Asn 




















120 





















<210> 473 
.^f;211> 238 
'S:212> PRT 
^213> Homo sapiens 

"J<220> 

Jlk221> SIGNAL 
-&222> -71 . . -1 

f|:400> 473 

Met Xaa Xaa Phe Thr Asp Pro Ser Ser Val Asn Glu Lys Lys Arg Arg 
"70 -65 -60 

^piu Arg Glu Glu Arg Gin Asn He Val Leu Trp Arg Gin Pro Leu He 
^!-55 -50 -45 -40 

'^jrhr Leu Gin Tyr Phe Ser Leu Glu He Leu Val He Leu Lys Glu Trp 
:;f -35 -30 -25 

ikhr Ser Lys Leu Trp His Arg Gin Ser He Val Val Ser Phe Leu Leu 
n -20 -15 -10 

Leu Leu Ala Gly Leu He Ala Thr Tyr Tyr Val Glu Gly Val His Gin 

-5 15 
Gin Tyr Val Gin Arg He Glu Lys Gin Phe Leu Leu Tyr Ala Tyr Trp 
10 15 20 25 

He Gly Leu Gly He Leu Ser Ser Val Gly Leu Gly Thr Gly Leu His 

30 35 40 

Thr Phe Leu Leu Tyr Leu Gly Pro His He Ala Ser Val Thr Leu Ala 

45 50 55 

Ala Tyr Glu Cys Asn Ser Val Asn Phe Pro Glu Pro Pro Tyr Pro Asp 

60 65 70 

Gin He He Cys Pro Asp Glu Glu Gly Thr Glu Gly Thr He Ser Leu 

75 80 85 

Trp Ser He He Ser Lys Val Arg He Glu Ala Cys Met Trp Gly He 
^0 95 100 105 

Gly Thr Ala He Gly Glu Leu Pro Pro Tyr Phe Met Ala Arg Ala Ala 

HO 115 120 

Arg Leu Ser Gly Ala Glu Pro Asp Asp Glu Glu Tyr Gin Glu Phe Glu 

125 130 135 

Glu Met Leu Glu His Ala Glu Ser Ala Gin Val Arg Thr Val Gly He 



-367- 



140 145 150 

Glu Asn Arg Thr Leu Tyr Phe Phe Leu Lys Arg Leu Leu Arg 
155 160 165 



<210> 474 
<211> 178 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -37. .-1 



<400> 474 
























Met Glu 


Arg 


Gin 


Ser Arg Val 


Met Ser 


Glu 


Lys 


Asp 


Glu 


Tyr 


Gin Phe 




-35 










-30 








-25 






Gin His 


Gin 


Gly Ala Val 


Glu 


Leu Leu 


Val 


Phe 


Asn 


Phe 


Leu 


Leu He 


-20 










-15 








-10 








Leu Thr 


He 


Leu 


Thr 


He 


Trp 


Leu Phe 


Lys 


Asn 


His 


Arg 


Phe 


Arg Phe 










1 






5 










10 


''jteu His 


Glu 


Thr 


Gly Gly Ala Met Val 


Tyr 


Gly Leu Xaa Met 


Gly Leu 






15 








20 










25 




^/ile Leu 


Xaa 


Tyr Ala 


Thr 


Ala 


Pro Thr 


Asp 


He 


Glu 


Ser Gly Xaa Val 




30 










35 








40 






iil?yr Asp 


Cys 


Val 


Lys 


Leu 


Thr 


Phe Ser 


Pro 


Ser 


Thr 


Leu 


Leu 


Val Asn 


45 










50 








55 








Ijjle Thr 


Asp 


Gin 


Val 


Tyr 


Glu 


Tyr Lys 


Tyr 


Lys 


Arg 


Glu 


He 


Ser Gin 


yf 0 








65 








70 








75 


His Xaa 


He 


Asn 


Pro 


His 


Xaa 


Gly Asn 


Ala 


He 


Leu 


Glu 


Lys 


Met Thr 








80 








85 










90 


L^phe Asp 


Pro 


Xaa 


He 


Phe 


Phe 


Asn Val 


Leu 


Leu 


Pro 


Pro 


He 


He Phe 






95 








100 










105 




ffeis Ala 


Gly 


Tyr 


Ser 


Leu 


Lys 


Lys Arg 


His 


Phe 


Phe 


Gin 


Asn 


Leu Gly 




110 










115 








120 






^l!feer lie 


Leu 


Thr 


Tyr 


Ala 


Phe 


Leu Gly 


Thr 


Ala 


He 


Ser 


Cys 


He Val 


03 125 










130 








135 








He Gly 



























140 



<210> 475 
<211> 96 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -21. .-1 

<400> 475 

Met Ser Met Gin Phe Leu Phe Lys Met Val Ala Leu Cys Cys Cys Leu 

-20 -15 -10 

Trp Lys He Ser Gly Cys Glu Glu Val Pro Leu Thr Tyr Asn Leu Leu 
-5 15 10 



-368- 



Lys Cys Leu Leu Asp Lys Ala His 
15 

Tyr lie Phe Ser Leu lie Ser Pro 

30 35 
Thr Leu Xaa lie Leu Leu lie Leu 

45 50 
Thr Val Ser Ser Xaa Cys Val His 
60 65 



Cys Val Leu Leu Thr Pro Cys Gly 
20 25 
Glu lie Leu Lys Leu Thr Leu lie 
40 

Lys Asn Leu His Leu Leu Trp Leu 
55 

Arg Ser Ser Ala Arg Lys Glu Lys 
70 75 



<210> 476 

<211> 41 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -24 . . -1 

<400> 476 

^flBt His Thr Phe Ala Asn Asp Arg Gly Leu Tyr Arg lie Leu Leu Leu 
% -20 -15 -10 

,Jf;4lis Phe Tyr Cys Leu Leu Arg Ser Ser Glu Tyr lie Leu Gly Tyr Lys 

rf -5 15 

f'Val Leu Gly Val Phe Phe Pro lie Leu 

Ul ■ 10 15 



, <210> 477 
J^<211> 113 
l^f212> PRT 
!"!<c213> Homo sapiens 

'';^4;220> 

'^lll:221> SIGNAL 
y;k222> -27 . . -1 

<400> 477 

Met Arg Xaa Lys Trp 
-25 

Leu Phe Phe Leu Leu 
-10 

Ser Glu Thr Tyr Cys 
10 

Arg Trp His Pro Tyr 
25 

Cys lie Cys Ser Glu 
40 

Pro Asn Val His Cys 
55 

Pro Arg Cys Pro Glu 

70 

Ser 



Lys Met Gly Gly Met Lys 
-20 

Leu Glu Gly Gly Xaa Thr 
-5 

Met Phe Gin Asp Lys Lys 
15 

Leu Glu Pro Tyr Gly Leu 
30 

Asn Gly Asn Val Leu Cys 
45 

Leu Ser Pro Val His lie 
60 

Asp Ser Leu Pro Pro Val 
75 80 



Tyr lie Phe Ser Leu 
-15 

Glu Gin Val Xaa His 
1 5 
Tyr Arg Val Gly Glu 
20 

Val Tyr Cys Val Asn 
35 

Ser Arg Val Arg Cys 
50 

Pro His Leu Cys Cys 
65 

Asn Asn Xaa Val Thr 
85 



-369- 



<210> 478 
<211> 250 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -18. .-1 

<400> 478 



Met Arg 


He 


Leu 
-15 


Gin 


Leu 


He 


Leu 


Leu 
-10 


Ala 


Leu Ala 


Thr Gly 
-5 


Leu 


Val 


Gly Gly 


Glu 


Thr 


Arg 


He 


He 


Lys 


Gly Phe Glu Cys 


Lys Pro 


His 


Ser 




1 








5 








10 








Gin Pro 


Trp 


Gin 


Ala 


Ala 


Leu 


Phe 


Glu 


Lys 


Thr Arg Leu Leu 


Cys 


Gly 


15 








20 










25 






30 


Ala Thr 


Leu 


He 


Ala 
35 


Pro 


Arg 


Trp 


Leu 


Leu 
40 


Thr Ala 


Ala His 


Cys 
45 


Leu 


Lys Pro 


Arg 


Tyr 


He 


Xaa 


His 


Leu 


Gly Gin His Asn Leu Gin 


Lys 


Glu 






50 










55 






60 






l^^^piu Gly 


Cys 


Glu 


Gin 


Thr Arg Thr Ala Thr Glu Ser 


Phe Pro 


His 


Pro 


^J^fely Phe 


65 










70 








75 






Asn 


Asn 


Ser 


Leu 


Pro 


Asn 


Lys 


Asp 


Xaa Xaa 


Asn Asp 


He 


Met 












85 








90 








^'teu Val 


Xaa 


Met 


Xaa 


Ser 


Pro 


Val 


Ser 


He 


Thr Trp 


Ala Val 


Arg 


Pro 










100 










105 






110 


-lieu Thr 


Leu 


Ser 


Ser 


Arg 


Cys 


Val 


Thr 


Ala 


Gly Thr 


Ser Cys 


Leu 


He 


IfBer Gly 






115 










120 






125 




Trp 


Gly 
130 


Ser 


Thr 


Ser 


Ser 


Pro 
135 


Gin 


Leu Arg 


Leu Pro 
140 


His 


Thr 


j^-Leu Arg 


Cys 
145 


Ala 


Asn 


He 


Thr 


He 
150 


He 


Glu 


His Gin 


Lys Cys 
155 


Glu 


Asn 


r!kla Tyr 


Pro 


Gly 


Asn 


He 


Thr 


Asp 


Thr 


Met 


Val Cys Ala Ser 


Val 


Gin 












165 








170 








'^blu Gly 


Gly 


Lys 


Asp 


Ser 


Cys Gin Gly Asp Ser Gly Gly Pro 


Leu 


Val 










180 










185 






190 


fjfcys Asn 


Gin 


Ser 


Leu 


Gin Gly He 


He 


Ser 


Trp Gly Gin Asp 


Pro 


Cys 








195 










200 






205 




Ala He 


Thr 


Arg 


Lys 


Pro 


Gly Val 


Tyr 


Thr 


Lys Val 


Cys Lys 


Tyr Val 






210 










215 






220 






Asp Trp 


He 
225 


Gin 


Glu 


Thr 


Met 


Lys 
230 


Asn 


Asn 











<210> 479 

<211> 151 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 

<222> -21. . -1 

<400> 479 

Met Ala Ala Ser Thr Ser Met Val Pro Val Ala Val Thr Ala Ala Val 



-20 

Ala Pro Val Leu 
-5 

Lys Lys Gin Leu 
15 

Leu His Ser Ser 
30 

Leu Pro Leu Ala 
45 

Ala Gin Asp Met 
60 

Lys Glu Tyr Asp 

Lys Ala Tyr Phe 
95 

Leu Lys Cys His 
110 

Gly Lys Val Lys 
125 



-15 

Ser lie Asn Ser 
1 

Leu Leu lie Ala 

Lys Trp Ser Ala 
35 

Glu Leu Gin Pro 
50 

Asp Ala Tyr Thr 
65 

Arg Ala Ala His 
80 

Leu Tyr Met Tyr 

Ser Ala Phe Ser 
115 

Ser Phe Lys 
130 
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-10 


Asp 


Phe 


Ser 


Asp 




5 






Gly 


Leu 


Thr 


Arg 


20 








Glu 


Leu 


Ala 


Phe 


Pro 


Pro 


Pro 


He 








55 


Leu 


Ala 


Lys 


Ala 






70 




Phe 


Leu 


His 


Gly 




85 






Ser 


Arg 


Tyr 


Leu 


100 








Glu 


Thr 


Ser 


He 



Leu Arg Glu lie 
10 

Glu Arg Gly Leu 
25 

Ser Leu Pro Ala 
40 

Thr Glu Glu Asp 

Tyr Phe Asp Val 
75 

Cys Asn Ala Arg 
90 

Val Arg Ala He 
105 

Phe Arg Thr Asn 
120 



480 
239 
PRT 

Homo sapiens 



SIGNAL 
-25. .-1 



|,|<400> 480 



f^et 


Pro 


Arg 


Lys 


Arg 


Lys 


Cys 


Asp 


Leu 


Arg 


Ala 


Val 


Arg Val 


Gly Leu 


;J-25 










-20 










-15 










-10 


Leu 


Leu 


Gly 


Gly 


Gly 
-5 


Gly Val 


Tyr 


Gly 


Ser 
1 


Arg 


Phe 


Arg 


Phe 


Thr 


Phe 


hxo 


Gly 


Cys 


Arg 


Ala 


Leu 


Ser 


Pro 


Trp 


Arg 


Val 


Arg 


Xaa 


5 

Gin 


Arg Arg 






10 










15 










20 








Arg 


Cys 


Glu 


Met 


Ser 


Thr 


Met 


Phe 


Ala 


Asp 


Thr 


Leu 


Leu 


He 


Val 


Phe 




25 










30 










35 










He 


Ser 


Val 


Cys 


Thr 


Ala 


Leu 


Leu 


Ala 


Glu 


Gly 


He 


Thr 


Trp 


Val 


Leu 


40 










45 










50 










55 


Val 


Tyr 


Arg 


Thr 


Asp 


Lys 


Tyr Lys 


Arg 


Leu 


Lys 


Ala 


Glu 


Val 


Glu 


Lys 










60 










65 










70 




Gin 


Ser 


Lys 


Lys 


Leu 


Glu 


Lys 


Lys 


Lys 


Glu 


Thr 


He 


Thr 


Glu 


Ser 


Ala 








75 










80 










85 






Gly 


Arg 


Gin 


Gin 


Lys 


Lys 


Lys 


He 


Glu 


Arg 


Xaa 


Xaa 


Xaa 


Xaa 


Leu 


Xaa 






90 










95 










100 








Asn 


Asn 


Asn 


Arg 


Asp 


Leu 


Ser 


Met 


Val 


Arg 


Met 


Lys 


Ser 


Met 


Phe 


Ala 




105 










110 










115 










He 


Gly 


Phe 


Cys 


Phe 


Thr 


Ala 


Leu 


Met 


Gly 


Met 


Phe 


Asn 


Ser 


He 


Phe 


120 










125 










130 










135 


Asp 


Gly 


Arg 


Val 


Val 


Ala 


Lys 


Leu 


Pro 


Phe 


Thr 


Pro 


Leu 


Ser 


Xaa 


Xaa 










140 










145 










150 




Xaa 


Gly 


Leu 


Ser 


His 


Arg Asn 


Leu 


Leu 


Gly 


Asp 


Asp 


Thr 


Thr 


Asp 


Cys 








155 










160 










165 






Ser 


Phe 


He 


Phe 


Leu 


Xaa 


He 


Leu 


Cys 


Thr 


Met 


Ser 


He 


Arg 


Gin 


Asn 
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170 175 180 

lie Gin Lys lie Leu Gly Leu Ala Pro Ser Arg Ala Ala Thr Lys Gin 

185 190 195 

Ala Gly Gly Phe Leu Gly Pro Pro Pro Pro Ser Gly Lys Phe Ser 
200 205 210 



<210> 481 
<211> 208 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -92. .-1 



<400> 481 



Met 


Arg 


Glu 


Pro 


Gin 


Lys 


Arg 


Thr 






-90 










-85 


Xaa 


Glu Gly Leu 


Arg 


Asp 


Pro 


Tyr 




-75 










-70 






Arg Arg 


Pro 


Pro 


Glu 


Arg 


Pro 












-55 








Ala 


Ser 


Thr 


Thr 


Pro 


Glu 


Glu 










-40 










Pro 


Ser 


Val 


Pro 


Arg 


Ser 


Asn 








-25 










yjjeu 


Met 


Ser 


lie 


Leu 


Ala 


Leu 


He 






-10 










-5 


Glu 


Ala 


Leu 


Val 


Trp 


Lys 


Val 


Leu 












10 








Xaa 


His 


Ser 


lie 


Phe 


Gly 


Asp 










25 








[phe 


Val 


Arg Arg 


Gly 


Tyr 


Leu 


He 








40 












Val 


Glu 


Tyr 


Xaa 


Phe 


Phe 


Trp 






55 










60 


Ser 


Xaa 


Leu 


Lys 


Xaa 


Xaa 


His 


Phe 




70 










75 




Ser 


Lys 


Asp 


Trp 


Pro 


Cys 


Asn 


Tyr 


85 










90 






Glu 


Val 


Glu 


Ala 


He 


Leu 


Asn 


Ser 



105 



Ala 


Thr 


He 


Ala 


Lys 


Xaa 


Xaa 


Ala 










-80 








Gly 


Arg 


Leu 


Cys 


Gly 


Ser 


Glu 


His 








-65 










Glu 


Glu 


Asp 


Pro 


Ser 


Thr 


Pro 


Glu 






-50 










-45 


Ala 


Ser 


Ser 


Thr 


Ala 


Gin 


Ala 


Gin 




-35 










-30 




Phe 


Gin 


Gly 


Thr 


Lys 


Lys 


Ser 


Leu 


-20 










-15 






Phe 


He 


Met 


Gly 


Asn 
1 


Ser 


Ala 


Lys 


Gly 


Lys 


Leu 


Gly 


Met 


Gin 


Pro 


Gly 






15 










20 


Pro 


Lys 


Lys 


He 


Val 


Thr 


Glu 


Xaa 




30 










35 




Tyr 


Xaa 


Pro 


Val 


Pro 


Arg Xaa 


Ser 


45 










50 






Gly 


Pro 


Arg 


Ala 


His 


Val 


Glu 


Ser 










65 








Val 


Ala 


Arg 


Val 


Arg 


Asn Arg 


Cys 








80 










Asp 


Trp 


Asp 


Ser 


Asp 


Asp Asp Ala 






95 










100 


Gly 


Ala 


Xaa 


Gly 


Tyr 


Ser 


Ala 


Pro 




110 










115 





<210> 482 
<211> 86 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -39. .-1 



<400> 482 



-372- 





Zi cam 


V d X 


O-L y 


Thr Ala 


His 


Xaa 


Xaa 


Val 


Asn Pro Asn Thr Arg Val 










-35 








-30 




-25 


Met 


Asn 


Ser 


Arg 


Gly He Trp Leu Ser Tyr Val Leu Ala 


He Gly Leu 








o n 
- A\J 








-15 






-10 


Leu 


His 


He 
-5 


Val 


Leu Leu 


Ser 


He 
1 


Pro 


Phe 


Val Ser Val 
5 


Pro Val Val 


Trp 


Thr 


Leu 


Thr 


Asn Leu 


He 


His 


Asn 


Met 


Gly Met Tyr 


He Phe Leu 


10 








15 










20 


25 


His 


Thr 


Val 


Lys 


Gly Thr 
30 


Pro 


Phe 


Glu 


Thr 
35 


Pro Asp Gin 


Gly Lys Ala 
40 


Arg 


Leu 


Leu 


Thr 
45 


His Trp 















<210> 483 
<211> 40 
<212> PRT 

<213> Homo sapiens 



<220> 
;s:221> 

i:222> -27. .-1 



,f221> SIGNAL 



^^'l:400> 483 

'^ket Arg Thr Leu Phe Gly Ala Val Arg Ala Pro Phe Ser Ser Leu Thr 

Jl -25 -20 -15 

llieu Leu Leu He Thr Pro Ser Pro Ser Pro Leu Leu Phe Asp Arg Gly 

-10 -5 15 

Jljeu Ser Leu Arg Ser Ala Met Ser 
' 10 



■"<:210> 484 

^^4211> 65 

1^212 > PRT 

■^^213 > Homo sapiens 

<220> 

<221> SIGNAL 
<222> -16 . . -1 

<400> 484 

Met Leu Gly Phe Phe Leu Phe Leu Ser Phe Val Leu Met Tyr Asp Gly 

-15 -10 -5 

Leu Arg Leu Phe Gly He Leu Ser Thr Cys Arg Val His His Thr Met 
15 10 15 

Asn Gin Phe Leu He Asp He Ser Ser Phe Thr Ser Arg Val Lys Lys 

20 25 30 

Lys He Phe Leu Phe Tyr Ala Phe Xaa Gly Cys Xaa Phe Gin Ser Ala 
35 40 45 

Thr 



<210> 485 



-373- 



<211> 130 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -55. .-1 



<400> 485 

Met Ala Met Trp Asn Arg Pro Xaa Xaa Xaa Leu Pro Gin Gin Pro Leu 
-55 -50 -45 -40 

Xaa Ala Glu Pro Thr Ala Glu Gly Glu Pro His Leu Pro Thr Gly Arg 

-35 -30 -25 

Xaa Xaa Thr Glu Ala Asn Arg Phe Ala Tyr Ala Ala Leu Cys Gly lie 

-20 -15 -10 

Ser Leu Ser Gin Leu Phe Pro Glu Pro Glu His Ser Ser Phe Cys Thr 

-5 15 
Glu Phe Met Ala Gly Leu Val Xaa Trp Leu Glu Leu Ser Glu Ala Val 
10 15 20 25 

Leu Pro Thr Met Thr Ala Phe Ala Ser Gly Leu Gly Gly Glu Gly Xaa 
30 35 40 

.:?i|Caa Cys Val Cys Ser Asn Phe Thr Glu Gly Pro His Leu Glu Gly Arg 
% 45 50 55 

^ro Asp Gly Asp His Ser Gly Pro Ser Glu Leu Leu Thr Gin Gly Trp 
I''' 60 65 70 

r%l3i Leu 
U1 75 



<210> 486 
^<211> 209 
= sc212> PRT 



'f<213> Homo sapiens 



^;|Jc221> SIGNAL 
0lc222> -84. .-1 



<400> 486 

Met Val Asn Phe 

Met Leu Val Phe 
-65 

Ser Asp Thr lie 
-50 

Thr Cys Phe Gly 
-35 

Ala Tyr Leu Cys 
-20 

Leu Gly Tyr Gly 

Asn lie His Leu 
15 

Gly Leu Ser Thr 
30 

Gly Pro Thr Xaa 



Pro Gin Lys lie 
-80 

Thr Leu Val Ala 

lie Arg Glu Gly 
-45 

Tyr Trp Leu Gly 
-30 

Asn Ala Gin lie 
-15 

Leu Phe Gly His 
1 

Arg Ala Leu Phe 
20 

Leu Arg Met Val 
35 

Arg Xaa Leu Leu 



Ala Gly Glu Leu 
-75 

lie Leu Leu His 
-60 

Thr Leu Met Gly 

Val Ser Ser Phe 
-25 

Thr Met Leu Gin 
-10 

Cys lie Val Leu 
5 

Tyr Leu Phe Trp 

Ala Val Leu Val 
40 

Cys Gly Thr Leu 



Tyr 


Gly 


Pro 


Leu 






-70 




Gly 


Met 


Lys 


Thr 




-55 






Thr 


Ala 


He 


Gly 


-40 








He 


Tyr 


Phe 


Leu 


Met 


Leu 


Ala 


Leu 








-5 


Phe 


He 


Thr 


Tyr 




10 






Leu 


Leu 


Val 


Gly 


25 








Ser 


Arg 


Thr 


Val 


Ala 


Ala 


Leu 


His 
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45 

Met Leu Phe Leu 

Gly lie Leu Asp 
80 

Val Pro Arg Asp 
95 

Thr Val Leu Asn 
110 

His 
125 



50 

Leu Tyr Leu His 
65 

Thr Leu Glu Gly 

lie Pro Ala Met 
100 

Ala Thr Ala Lys 
115 



55 

Phe Ala Tyr His 
70 

Pro Asn lie Pro 
85 

Leu Pro Ala Ala 

Ala Val Ala Val 
120 



60 

Lys Xaa Val Xaa 
75 

Pro lie Gin Arg 
90 

Arg Leu Pro Thr 
105 

Thr Leu Gin Ser 



<210> 487 

<211> 36 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
222> -17, , -1 

J|400> 487 

" flet Gly Trp Gin Arg Trp Trp Cys Phe His Leu Gin Ala Glu Ala Ser 

.^^Ala His Pro Pro Gin Gly Leu Gin Ala Gin Phe Ser Cys Cys Pro Trp 
& 15 10 15 

ip3il Gly He Cys 



^lf210> 488 

^!<211> 44 

'f212> PRT 

']^k213> Homo sapiens 

<221> SIGNAL 
<222> -29. . -1 

<400> 488 

Met Met Ser Ser Glu Leu Arg Arg Asn Pro His Phe Leu Lys Ser Asn 

-25 -20 -15 

Leu Phe Leu Gin Leu Leu Val Ser His Glu He Val Cys Ala Thr Glu 

-10 -5 1 

Thr Val Thr Thr Asn Phe Leu Arg His Glu Lys Ala 
5 10 15 



<210> 489 

<211> 163 

<212> PRT 

<213> Homo sapiens 



<220> 



<221> SIGNAL 
<222> -52 . .-1 



<400> 489 



jyiet: gj.u 


His 


Tyr 




- 50 




Pro Met 


Pro 


Gin 


- 35 






Gly Ser 


Lys 


He 


-20 








Ssr 


Ax a 


Gly Lys 


Ala 


Val 




15 




Leu His 


Gin 


Leu 


30 






Val Pro 


Xaa 


Ser 


45 






His Val 


Pro 


Ala 


Asn Glu 


Cys 


Gly 


'^Ij^et Pro 




80 


Ser 


Ser 


95 




pilrhr Arg 


Ser 




110 







Arg Lys 


Ala 


Gly 






-45 


Leu Pro 


Pro 


Asp 




-30 




Arg Asn 


Leu 


Leu 


-15 






Arg His Val 


Val 


1 






Ser Cys 


Ala 


Glu 






20 


Thr Lys 


Leu 


Xaa 




35 




Pro Asp 


Thr 


Gly 


50 






Xaa Trp 


Val 


Leu 


65 






Tyr Gin 


Pro 


Pro 


Ser Cys 


Gly 


Pro 



100 



Ser Val Glu Leu 

Thr Leu Glu Met 
-25 

Gly Leu Ala Leu 
-10 

Phe Ser Gly Ser 
5 

He Val Lys Arg 

Phe Leu Gin Thr 
40 

Leu Xaa Pro Leu 
55 

Leu Xaa Arg Asp 
70 

Gly Ala Pro Pro 
85 

Arg Ser Xaa Lys 



Pro Ala Pro Ser 
-40 

Arg Val Arg Asp 

Gly Arg Leu Glu 
-5 

Gly Arg Ala Ala 
10 

Arg Val Pro Gly 
25 

Glu Asp Ser Trp 

Thr Val Arg Arg 
60 

Pro Leu Asp Pro 
75 

Gly Leu Gly Ser 
90 

Arg Ala Xaa Xaa 
105 



,P1:210> 490 

^'<211> 64 

[k212> PRT 

;s<213> Homo sapiens 

'^■<:220> 

^'422 1> SIGNAL 
jll:222> -47 . . -1 

'<400> 490 

Met His Gly Phe Glu He He Ser Leu Lys Glu Glu Ser Pro Leu Gly 

-45 -40 -35 

Lys Val Ser Gin Gly Pro Leu Phe Asn Val Thr Ser Gly Ser Ser Ser 

-30 -25 -20 

Pro Val Thr Trp Leu Gly Leu Leu Ser Phe Gin Asn Leu His Cys Phe 
-15 -10 -5 1 

Pro Asp Leu Pro Thr Glu Met Pro Leu Xaa Ala Lys Gly Xaa Asn Thr 
5 10 15 



<210> 491 
<211> 218 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 



-376- 



<222> -50. .-1 

<400> 491 

Met His His Gly 

-50 

Gin Gin Val Val 

Leu Asp Arg Tyr 
-15 

Ser Ala Ser lie 
1 

Ser Gin Asp Leu 
15 

Arg His Asn Val 

Xaa Leu Lys Val 
50 

Leu Thr Ser Glu 
65 

Gin Pro Glu Glu 
80 

.;.i|lrg Lys Val Glu 

;';|5 

2/fhe Pro Xaa Asn 
^'bly Leu lie Pro 

yl 13 0 

^ll>ro Asp Thr Glu 

yj 145 

fjXaa Phe Pro Thr 
r 160 



Leu Thr Pro Leu 
-45 

Lys Phe Leu lie 
-30 

Gly Arg Thr Ala 

Val Ser Leu Leu 
5 

Ser Gly Gin Thr 
20 

lie Cys Gin Leu 
35 

Ser Ser Glu Asn 

Glu Glu Ser Gin 
70 

Met Ser Gin Glu 
85 

Xaa Xaa Met Lys 
100 

Leu Xaa Asn Gly 
115 

Pro Xaa Lys Xaa 

Asn Glu Gin Tyr 
150 

Thr Leu Pro lie 
165 



Leu Leu Gly Val 
-40 

Lys Lys Lys Ala 
-25 

Leu lie Leu Ala 
-10 

Leu Glu Gin Asn 
10 

Ala Lys Lys Tyr 
25 

Leu Ser Asp Tyr 
40 

Ser Asn Pro Xaa 
55 

Arg Leu Lys Gly 

Pro Glu lie Asn 
90 

Lys His Gly Ser 
105 

Ala Thr Ala Asp 
120 

Xaa Thr Pro Glu 
135 

His Arg Asp Phe 
Lys Gin 



His Glu Gin Lys 
-35 

Asn Leu Asn Ala 
-20 

Val Cys Cys Gly 
-5 

lie Asp Val Ser 

Ala Val Ser Ser 
30 

Lys Xaa Lys Gin 
45 

Gin Asp Leu Lys 
60 

Ser Glu Asn Ser 
75 

Xaa Gly Gly Asp 

Xaa His Met Gly 
110 

Asn Gly Asp Asp 
125 

Ser Xaa Gin Phe 
140 

Ser Gly His Pro 
155 



f <:210> 492 
''4:211> 216 
ll!fe212> PRT 
ij:l:213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -15 . . -1 

<400> 492 

Met Val Cys Val Leu Val Leu Ala Ala Ala Ala Gly Ala Val Ala Val 
-15 -10 -5 1 

Phe Leu lie Leu Arg lie Trp Val Val Leu Arg Ser Met Asp Val Thr 

5 10 15 

Pro Arg Glu Ser Leu Ser He Leu Val Val Ala Gly Ser Gly Gly His 

20 25 30 

Thr Thr Glu He Leu Arg Leu Leu Gly Ser Leu Ser Asn Ala Tyr Ser 

35 40 45 

Pro Arg His Tyr Val He Ala Asp Thr Asp Glu Met Ser Ala Asn Lys 
50 55 60 65 

He Asn Ser Phe Glu Leu Xaa Arg Xaa Asp Arg Xaa Pro Ser Asn Met 

70 75 80 

Xaa Thr Lys Tyr Tyr He His Arg He Pro Xaa Ser Arg Glu Val Gin 



85 

Gin Ser Trp Pro 
100 

Ser Xaa Pro Leu 
115 

Gly Pro Gly Thr 
130 

lie Leu Gly lie 

Arg Val Lys Thr 
165 

Asn Tyr Phe lie 
180 

Ser Val Tyr Leu 
195 



Ser Thr Val Xaa 
105 

He His Arg Val 
120 

Cys Val Pro He 
135 

Lys Lys Val He 
150 

Leu Ser Met Ser 

Val Gin Trp Pro 
185 

Gly Arg He Val 
200 



90 

Thr Thr Leu His 

Lys Pro Xaa Leu 
125 

Cys Val Ser Ala 
140 

He Val Tyr Val 
155 

Gly Lys He Leu 
170 

Ala Leu Lys Glu 



95 

Ser Met Trp Leu 
110 

Val Leu Cys Asn 

Leu Leu Leu Gly 
145 

Glu Ser He Cys 
160 

Phe His Leu Ser 
175 

Lys Tyr Pro Lys 
190 



<210> 493 
<211> 134 
<212> PRT 

^i¥:213> Homo sapiens 
Si220> 

7<221> SIGNAL 
:<222> -19. .-1 

1*^400> 493 



iMBt 


Pro 


Leu Gly Ala Arg 


He 


Leu 


Phe 


His 


Gly 


Val 


Phe 


Tyr 


Ala Gly 






-15 








-10 










-5 


Gly 


Phe 


Ala He Val Tyr 


Tyr 


Leu 


He 


Gin 


Lys 


Phe 


His 


Ser 


Arg Thr 


it 




1 




5 










10 








Tyr 


Tyr Lys Leu Ala 


Val 


Glu 


Gin 


Leu 


Gin 


Xaa 


His 


Pro 


Glu Ala 


Gin 


15 




20 










25 








Glu 


Ala Leu Gly Pro 


Pro 


Leu 


Asn 


He 


His 


Tyr 


Leu 


Lys 


Leu He 


%0 




35 










40 








45 




Arg 


Glu Asn Phe Val 


Asp 


He 


Val 


Xaa 


Ala 


Lys 


Leu 


Lys 


He Pro 


'iff 




50 








55 










60 


Val 


Ser 


Gly Ser Lys Ser 


Glu 


Gly 


Leu 


Leu 


Tyr 


Val 


His 


Ser 


Ser Arg 






65 






70 










75 




Gly 


Gly 


Pro Phe Gin Arg 


Trp 


His 


Leu 


Asp 


Glu 


Val 


Phe 


Leu 


Glu Leu 






80 




85 










90 






Lys 


Asp 


Gly Gin Gin He 


Pro 


Val 


Phe 


Lys 


Leu 


Ser 


Gly 


Glu 


Asn Gly 




95 




100 










105 









Asp Glu Val Lys Lys Glu 
110 115 



<210> 494 
<211> 85 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -16 . . -1 



-378- 



<400> 494 



Met 


Ala 


Val 


Thr 


Ala 


Leu 


Ala 


Ala Xaa Thr Trp Leu 


Gly 


Val 


Trp 


Gly 




-15 










-10 


-5 










Val 


Arg 


Thr 


Met 


Gin 


Ala 


Arg 


Gly Phe Gly Ser Asp 


Gin 


Ser 


Glu 


Asn 


1 








5 






10 






15 




Val 


Asp 


Arg 


Gly 


Ala 


Gly 


Ser 


lie Arg Glu Ala Gly 


Gly 


Ala 


Phe 


Gly 








20 








25 




30 






Lys 


Arg 


Glu 


Gin 


Ala 


Glu 


Glu 


Glu Arg Tyr Phe Arg 


Ala 


Gin 


Ser 


Thr 






35 










40 


45 








Glu 


Gin 


Leu 


Ala 


Xaa 


Leu 


Lys 


Lys Xaa His Glu Glu 


Glu 


He 


Val 


His 




50 










55 


60 










His 


Arg 


Glu 


Gly 


Asp 

















<210> 


495 


<211> 


292 


<212> 


PRT 


<213> 


Homo sapiens 


"']!<220> 




S<:221> 


SIGNAL 


\%222> 


-29. . -1 



tl:400> 495 



met 


His 


Gly 


Leu 


Leu 
-25 


His 


Tyr 


Leu 


Phe 


His 
-20 


Thr 


Arg 


Asn 


His 


Thr 
-15 


Phe 


Jile 


Val 


Leu 


His 


Leu 


Val 


Leu 


Gin Gly Met Val 


Tyr 


Thr 


Glu 


Tyr 


Thr 








-10 










-5 










1 






.^rp 


Glu 
5 


Val 


Phe 


Gly 


Tyr 


Cys 
10 


Gin 


Glu 


Leu 


Glu 


Leu 
15 


Ser 


Leu 


His 


Tyr 




Leu 


Leu 


Pro 


Tyr 


Leu 


Leu 


Leu Gly Val Asn Leu 


Phe 


Phe 


Phe 


Thr 


.,2 0 










25 










30 










35 


'%en 


Thr 


Cys 


Gly 


Thr 


Asn 


Pro 


Gly 


He 


He 


Thr 


Lys 


Ala 


Asn 


Glu 


Leu 


IJ 

'riLeu 








40 










45 










50 




Phe 


Leu 


His 
55 


Val 


Tyr 


Glu 


Phe 


Asp 
60 


Glu 


Xaa 


Met 


Phe 


Pro 
65 


Lys 


Asn 


Val 


Arg 


Cys 
70 


Ser 


Thr 


Cys 


Asp 


Leu 
75 


Arg 


Lys 


Pro 


Ala 


Arg 
80 


Ser 


Xaa 


His 


Cys 


Xaa 
85 


Val 


Cys 


Asn 


Trp 


Cys 
90 


Val 


His 


Arg 


Phe 


Xaa 
95 


His 


His 


Cys 


Val 


Trp 


Val 


Asn 


Asn 


Cys 


He 


Gly 


Ala 


Trp 


Asn 


He 


Arg 


Xaa 


Phe 


Leu 


He 


100 










105 










110 










115 


Tyr 


Val 


Leu 


Thr 


Leu 
120 


Thr 


Ala 


Ser 


Ala 


Ala 
125 


Thr 


Val 


Ala 


He 


Val 
130 


Ser 


Thr 


Thr 


Phe 


Leu 


Val 


His 


Leu 


Val 


Val 


Met 


Ser Asp 


Leu 


Tyr 


Gin 


Glu 








135 










140 










145 






Thr 


Tyr 


He 
150 


Asp 


Asp 


Leu 


Gly 


His 
155 


Leu 


His 


Val 


Met 


Asp 
160 


Thr 


Val 


Phe 


Leu 


He 
165 


Gin 


Tyr 


Leu 


Phe 


Leu 
170 


Thr 


Phe 


Pro 


Arg 


He 
175 


Val 


Phe 


Met 


Leu 


Gly 


Phe 


Val 


Val 


Val 


Leu 


Xaa 


Phe 


Leu 


Leu Gly Gly 


Tyr 


Leu 


Leu 


Phe 


180 










185 










190 










195 


Val 


Leu 


Tyr 


Leu 


Ala 
200 


Ala 


Thr 


Asn 


Gin 


Thr 
205 


Thr 


Asn 


Glu 


Trp 


Tyr 
210 


Arg 



-379- 



Xaa Asp Trp Ala 
215 

Ser Ala Glu Pro 
230 

Xaa Asn Leu Gin 
245 

Lys Lys Gin Glu 
260 



Trp Cys Gin Arg 

Gin Val His Arg 
235 

Glu lie Phe Leu 
250 



Cys Pro Leu Val 
220 

Asn lie His Ser 

Pro Ala Phe Pro 
255 



Ala Trp Pro Pro 
225 

His Gly Leu Arg 
240 

Cys His Glu Arg 



<210> 496 
<211> 122 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -56. .-1 



<400> 496 



r^Met 


Thr 


Gly 


Phe 


Leu 


Leu 


Pro 


Pro 


Ala 


Ser Arg Gly Thr Arg Arg 


Ser 




-55 










-50 










-45 






I'feys 


Ser 


Arg 


Ser Arg 


Lys 


Arg 


Gin 


Thr 


Arg 


Arg Arg Arg Asn 


Pro 


Ser 


if 40 










-35 










-30 






-25 


^;^|er 


Phe 


Val 


Ala 


Ser 


Cys 


Pro 


Thr 


Leu 


Leu 


Pro 


Phe Ala Cys 


Val 


Pro 










-20 










-15 






-10 






Ala 


Ser 


Pro 
-5 


Thr 


Thr 


Leu 


Ala 


Phe 


Pro 


Pro 


Val Xaa Leu 


Thr Gly 


Ul»ro 


Xaa 


Thr 


Asp 


Gly 


He 


Pro 


Phe 


1 

Ala 


Leu 


Xaa 


5 

Ser Ala Ala 


Gly 


Pro 




10 










15 










20 






|.f>he 


Cys 


Ala 


Ser 


Phe 


Pro 


Ser Gly Xaa 


Leu 


Ser 


Pro Pro Gly 


Pro 


Leu 


^25 










30 










35 






40 


;,^:Pro 


Gly 


Val 


Arg 


Gly Leu 


Pro 


Leu 


Pro 


Ser 


Val 


Phe Tyr Ser 


Cys 


Gly 










45 










50 






55 




'iJvla 


His 


Pro 


Lys 


Val 


Leu 


Lys 


Val 


Ala 


Leu 
















60 










65 













<210> 497 
<211> 59 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -28 . . -1 

<400> 497 

Met Leu Xaa Leu Ser Arg Ala Thr 
-25 

Met Pro Val He Pro Ala Leu Gin 

-10 -5 
Gly Gin Glu Phe Glu Thr Ser Leu 
5 10 
Glu Leu Leu Lys Pro Arg Arg Arg 



Lys Xaa Gly Arg Ala Arg Trp Leu 
-20 -15 
Glu Ala Xaa Ala Gly Gly Ser Arg 
1 

Ala Asn Met Glu Thr Glu Ala Gly 

15 20 
Arg Leu Gin 



25 



-380- 



30 



<210> 498 
<211> 99 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -13 . . -1 



<400> 498 

Met His Leu Leu Ser Asn Trp Ala Asn Pro Ala Ser Ser Arg Arg Pro 

-10 -5 1 

Ser Met Ala Ala Ser Gly Thr Ser Trp lie Ser Ser Thr Leu Ala His 

5 10 15 

Ser Leu Ser Leu Arg Asp Val Ser Glu Arg Leu Cys Ser Cys Trp Arg 
20 25 30 35 

Thr lie Ser Met Gly Pro Cys Ala Arg Gly Ser Pro Met Asn Ser Ser 
40 45 50 

rjfly Val His Arg Lys Ser Ser Arg Leu Phe Tyr lie Arg Thr Pro Met 
SI 55 60 65 

i%rg Arg Ser Ser Cys His Leu Glu Cys Xaa Val lie Phe Leu Leu Gly 
:^ 70 75 80 

';;^rg Gin Leu 
4^ 85 



^.^210> 499 

nf:211> 99 

5^^<212> PRT 

•„<213> Homo sapiens 

^^^220> 

^^^k221> SIGNAL 
<222> -13. .-1 



<400> 499 
Met His Leu Leu 
-10 

Ser Met Ala Ala 
5 

Ser Leu Ser Leu 
20 

Thr lie Ser Met 

Gly Val His Arg 
55 

Arg Arg Ser Ser 
70 

Arg Gin Leu 
85 



Ser Asn Trp Ala 

Ser Gly Thr Ser 
10 

Arg Asp Val Ser 
25 

Gly Pro Cys Ala 
40 

Lys Ser Ser Arg 

Cys His Leu Xaa 
75 



Asn Pro Ala Ser 
-5 

Trp lie Ser Ser 
15 

Glu Arg Leu Cys 
30 

Arg Gly Ser Pro 
45 

Leu Phe Tyr lie 
60 

Cys Gin Val He 



Ser Arg Arg Pro 
1 

Thr Leu Ala His 

Ser Cys Trp Arg 
35 

Met Asn Ser Ser 
50 

Arg Thr Pro Met 
65 

Phe Leu Leu Gly 
80 



-381- 



<210> 500 
<211> 108 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -25 . . -1 



<400> 500 

Met Ser Leu Thr 

-25 

Val Thr He Ala 

Arg Phe Tyr Val 
10 

He Gin Lys Asp 
25 

Leu Gly Asp Lys 
40 

^rpro Phe Cys Asp 
;!'ksn Val Gly Pro 

y 75 



Ser 


Ser Ser Ser 




-20 


Ala 


Gly Thr Ala 


-5 




Lys 


Asp His Arg 




15 


Asn 


Pro Lys He 




30 


Ala 


Val Tyr Cys 




45 


Gly 


Ala His Thr 


60 




Leu 


He He Lys 



Val Arg Val Glu 
-15 

Ala He Gly Tyr 
1 

Asn Lys Ala Met 

Val His Ala Phe 
35 

Arg Cys Trp Arg 
50 

Lys His Asn Glu 
65 

Lys Lys Glu Thr 
80 



Trp He Ala Ala 
-10 

Leu Ala Tyr Lys 
5 

He Asn Leu His 
20 

Asp Met Glu Asp 

Ser Lys Lys Phe 
55 

Glu Thr Gly Asp 
70 



J,k210> 501 
.f|c211> 183 
; <212> PRT 
.:i<:213> Homo sapiens 

f<220> 

'^^221> SIGNAL 
^^;^222> -15. .-1 

1|:400> 501 

Met Glu Ala Met Trp Leu Leu Cys Val Ala Leu Ala Val Leu Ala Trp 
-15 -10 -5 1 

Gly Phe Leu Trp Val Trp Asp Ser Ser Glu Arg Met Lys Ser Arg Glu 

5 10 15 

Gin Gly Arg Arg Leu Gly Ala Glu Ser Arg Thr Leu Leu Val He Ala 

20 25 30 

His Pro Asp Asp Glu Ala Met Phe Phe Ala Pro Thr Val Leu Gly Leu 

35 40 45 

Ala Arg Leu Arg His Trp Val Tyr Leu Leu Cys Phe Ser Ala Gly Asn 
50 55 60 65 

Tyr Tyr Asn Gin Gly Glu Thr Arg Lys Lys Glu Leu Leu Gin Ser Cys 

70 75 80 

Asp Val Leu Gly He Pro Leu Ser Ser Val Met He He Asp Asn Arg 

85 90 95 

Asp Phe Pro Xaa Asp Pro Gly Met Gin Trp Asp Thr Xaa His Val Ala 

100 105 110 

Xaa Val Leu Leu Gin His He Glu Val Asn Gly He Asn Leu Val Val 

115 120 125 

Thr Phe Asp Ala Gly Gly Xaa Ser Gly His Ser Asn His He Ala Leu 



-382- 



130 135 140 145 

Tyr Ala Ala Val Arg Lys Leu Glu Gly Gin lie Cys Lys Pro Cys Gly 

150 155 160 

Thr Gly Gin Asp Phe Lys Glu 
165 



<210> 502 

<211> 98 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 

<222> -15. .-1 



<400> 502 
















Met 


Glu 


Ala 


Met 


Trp 


Leu Leu 


Cys 


Val 


Ala Leu Ala Val Leu 


Ala Trp 


-15 










-10 






-5 


1 


Gly 


Phe 


Leu 


Trp 


Val 


Trp Asp 


Ser 


Ser 


Glu Arg Met Lys Ser Arg Glu 


Slfeln 






5 








10 


15 




Gly 


Xaa 


Arg 


Leu Gly Ala Glu Ser Arg Thr Leu Leu Val 


He Ala 






20 








25 




30 






Pro 


Asp 


Asp 


Glu 


Ala Met 


Phe 


Phe 


Ala Pro Thr Val Leu 


Gly Leu 




35 








40 






45 


rkia 


Arg 


Leu 


Arg 


His 


Trp Val 


Tyr 


Leu 


Leu Cys Phe Ser Ala 


Val Phe 


mo 










55 






60 


65 




Arg 


Glu 


Leu 


Ser 


Glu Tyr 


Thr 


Glu 


Xaa Leu Thr Ser Glu 


Pro Leu 










70 








75 


80 



J Xaa Ala 



'5210> 503 
%k211> 183 
Ilk 2 12 > PRT 
J3<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -57 . . -1 



<400> 503 
Met Asp Val Thr 
-55 

Met Leu Lys Lys 
-40 

Ala Phe He Lys 
-25 

Val Met Glu Phe 

Thr Lys Gly Asn 
10 

Glu He Leu Arg 
25 

Arg Xaa He Lys 



Gly Asp Glu Glu 
-50 

Tyr Ser His His 
-35 

Lys Asn Pro Pro 
-20 

Cys Gly Ala Gly 
-5 

Thr Leu Lys Glu 
15 

Gly Leu Xaa His 
30 

Gly Gin Asn Val 



Glu Glu He Lys 

Arg Asn He Ala 
-30 

Gly Met Asp Asp 
-15 

Ser Val Thr Asp 
1 

Glu Trp He Ala 

Leu His Gin His 
35 

Leu Leu Thr Glu 



Gin Glu He Asn 
-45 

Thr Tyr Tyr Gly 

Gin Leu Trp Leu 
-10 

Leu He Lys Asn 
5 

Tyr He Cys Xaa 
20 

Lys Val He His 
Asn Ala Glu Val 



-383- 



40 

Lys Leu Val Asp 

Arg Xaa Asn Thr 
75 

lie Ala Cys Asp 
90 

Leu Trp Ser Leu 
105 

Leu Ser Val Thr 
120 



45 

Phe Gly Xaa Xaa 
60 

Phe lie Gly Thr 

Glu Asn Pro Xaa 
95 

Gly He Thr Ala 
110 

Cys Thr Pro 
125 



50 

Ala Gin Leu Asp 
65 

Pro Tyr Trp Met 
80 

Ala Thr Tyr Asp 

He Glu Met Ala 
115 



55 

Arg Thr Val Gly 
70 

Ala Pro Xaa Val 
85 

Phe Lys Xaa Asp 
100 

Glu Gly Leu Pro 



<210> 504 
<211> 140 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
«,,<222> -14. .-1 

.f^fc400> 504 

Phe Leu Thr Ala Leu Leu Trp Arg Gly Arg He Pro Gly Arg Gin 

Jlrrp He Gly Lys His Arg Arg Pro Arg Phe Val Ser Leu Arg Ala Lys 
$ 5 10 15 

jpln Asn Met He Arg Arg Leu Glu He Glu Ala Glu Asn His Tyr Trp 
i1 2 0 25 3 0 

, Leu Ser Met Pro Tyr Met Thr Arg Glu Gin Glu Arg Gly His Ala Ala 
,iL35 40 45 50 

,f.eu Arg Arg Arg Glu Ala Phe Glu Ala He Lys Ala Ala Ala Thr Ser 
Z 55 60 65 

'%ys Phe Pro Pro His Arg Phe He Ala Asp Gin Leu Asp His Leu Asn 
If 70 75 80 

sllXaa His Gin Glu Met Val Leu He Leu Ser Arg His Pro Trp He Leu 
;| 85 90 95 

Trp He Thr Glu Leu Thr He Phe Thr Trp Ser Gly Leu Lys Asn Cys 
100 105 110 

Ser Leu Cys Glu Asn Glu Leu Trp Thr Ser Leu Tyr 

115 120 125 



<210> 505 
<211> 59 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -14. . -1 



<400> 505 

Met Ala Ala Leu Val Thr Val Leu Phe Thr Gly Val Arg Arg Leu His 
-10 -5 1 



-384- 



Cys Ser Ala Xaa Leu Gly Arg Ala Ala Ser Gly Xaa Tyr Ser Arg Asn 

5 10 15 

Trp Leu Pro Thr Pro Pro Ala Thr Gly Pro Leu Pro Ser Ser Gin Thr 

20 25 30 

Gly His Met Arg Met Ala Ala Leu Leu Pro Gin 
35 40 45 



<210> 506 
<211> 101 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -36. .-1 



<400> 506 



Met 


Gly 


Pro 


Tyr 


Asn 


Val 


Ala 


Val 


Pro Ser 


Asp Val 


Ser 


His 


Ala Arg 




-35 










-30 






-25 










Tyr 


Phe 


Leu 


Phe 


His 


Arg 


Pro 


Leu Arg 


Leu Leu 


Asn 


Leu 


Leu He 












-15 








-10 






-5 


Jfteu 


He 


Glu 


Gly Ser Val Val 


Phe 


Tyr Gin 


Leu Tyr 


Ser 


Leu 


Leu Arg 










1 








5 






10 




^^^^ 


Glu 


Lys 


Trp 


Asn 


His 


Thr 


Leu 


Ser Met 


Ala Leu 


He 


Leu 


Phe Cys 






15 










20 






25 






'|Asn 


Tyr 


Tyr 


Val 


Leu 


Phe 


Lys 


Leu 


Leu Arg Asp Arg Xaa Xaa Leu Gly 




30 










35 






40 










Ala 


Tyr 


Ser 


Tyr 


Pro 


Leu 


Asn 


Ser Tyr 


Glu Leu 


Lys 


Ala 


Asn Xaa 


.45 










50 








55 






60 




Ala 


Ser 


Xaa 


Gin 



















^Jlfe210> 507 
0;l;211> 341 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -55 . . -1 



<400> 507 

Met Arg Lys Val 

-55 

Ala Leu Cys Lys 

Leu Ala Cys Arg 
-20 

Leu Ala Ser His 
-5 

Ser Asn Leu Gin 
10 

Phe Gin Arg Leu 



Val Leu He Thr 
-50 

Arg Leu Leu Ala 
-35 

Asn Met Ser Lys 

Pro Thr Ala Glu 
1 

Ser Phe Phe Arg 
15 

Asp Cys He Tyr 



Gly Ala Ser Ser 
-45 

Glu Asp Asp Glu 
-30 

Ala Glu Ala Val 
-15 

Val Thr He Val 
5 

Ala Ser Lys Glu 
20 

Leu Asn Ala Gly 



Gly He Gly Leu 
-40 

Leu His Leu Cys 
-25 

Cys Ala Ala Leu 
-10 

Gin Val Asp Val 

Leu Lys Gin Arg 
25 

He Met Pro Asn 
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30 35 40 

Pro Gin Leu Asn lie Lys Ala Leu Phe Phe Gly Leu Phe Ser Arg Lys 

45 50 55 

Val lie His Met Phe Ser Thr Ala Glu Gly Leu Leu Thr Gin Gly Asp 

60 65 70 

Lys He Thr Ala Asp Gly Leu Gin Glu Val Phe Glu Thr Asn Val Phe 

75 80 85 

Gly His Phe He Leu He Arg Glu Leu Glu Pro Leu Leu Cys His Ser 
90 95 100 105 

Asp Asn Pro Ser Gin Leu He Trp Thr Ser Ser Arg Ser Ala Arg Lys 

110 115 120 

Ser Asn Phe Ser Leu Glu Asp Phe Gin His Ser Lys Gly Lys Glu Pro 

125 130 135 

Tyr Ser Ser Ser Lys Tyr Ala Thr Asp Leu Leu Ser Val Ala Leu Asn 

140 145 150 

Arg Asn Phe Asn Gin Gin Gly Leu Tyr Ser Asn Val Ala Cys Pro Gly 

155 160 165 

Thr Ala Leu Thr Asn Leu Thr Tyr Gly He Leu Pro Pro Phe He Trp 
170 175 180 185 

Thr Leu Leu Met Pro Ala He Leu Leu Leu Arg Phe Phe Ala Asn Ala 

190 195 200 

-fhe Thr Leu Thr Pro Tyr Asn Gly Thr Glu Ala Leu Val Trp Leu Phe 
ll| 205 210 215 

i^feis Gin Lys Pro Glu Ser Leu Asn Pro Leu He Lys Tyr Leu Ser Ala 
f 220 225 230 

';Jhr Thr Gly Phe Gly Arg Asn Tyr He Met Thr Gin Lys Met Asp Leu 

235 240 245 

lAsp Glu Asp Thr Ala Glu Lys Phe Tyr Gin Lys Leu Leu Glu Leu Glu 
^|50 255 260 265 

jlys His He Arg Val Thr He Gin Lys Thr Asp Asn Gin Ala Arg Leu 

270 275 280 

,i^er Gly Ser Cys Leu 
ill 285 



f^210> 508 
ft211> 108 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -42. .-1 



<400> 508 

Met His He Leu Gin Leu Leu Thr 
-40 -35 
He Val His Cys Pro Asp Thr Gly 

-25 -20 
Asp Leu Val Cys His Glu Phe Cys 
-10 -5 
Leu Gin Xaa Gin Lys Thr Val Leu 
10 

Ala He Tyr Ala Ser Gin Thr Glu 

25 30 
Gly Asp Gly Gly Ser Gly Ser Lys 



Thr Val Asp Asp Gly He Gin Ala 
-30 

Lys Asp He Trp Asn Leu Leu Phe 
-15 

Gin Ser Asp Asp Pro Ala He He 

1 5 
Ala Ser Val Phe Ser Val Leu Ser 
15 20 
Gin Xaa Tyr Leu Lys He Xaa Lys 
35 

Gly Arg Pro Xaa Xaa Gin Thr Glu 
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40 45 50 

Xaa Phe Leu Cys lie Ser Lys Pro Ser Ser Phe Leu 
55 60 65 



<210> 509 

<211> 80 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 

<222> -26. .-1 

<400> 509 

Met Glu Glu lie Ser Ser Pro Leu 

-25 -20 
Thr Asn Gin Val Leu He Thr Ala 
-10 -5 
Ser Val Arg Cys Val Glu Lys Arg 

'IJjys His Leu Ser Arg Cys He Asp 
It:' 25 30 

f^Phe Thr Phe Cys Leu Glu Phe Ser 
40 45 



Val 


Glu 


Phe 


Val 
-15 


Lys 


Val 


Leu 


Cys 


Arg 


Ala 


Val 
1 


Pro 


Thr 


Lys 


Lys 
5 


Ala 


Phe 


Trp 


He 


Pro 


Lys 


Thr 


Thr 


Ser 


15 










20 






Gly He 


Ser Gly Phe 


Leu 


Asn Asp 










35 








Arg 


His 


Arg 


Cys 


Gin 


Leu 


Thr 


Glu 



^j1:210> 510 
<211> 158 
],j^212> PRT 
h^f213> Homo sapiens 

'lc220> 

^4;221> SIGNAL 
ll:222> -44. .-1 



<400> 510 



Met 


Ala 


Gly 


Phe 


Leu Asp Asn Phe Arg Trp 


Pro 


Glu Cys 


Glu 


Cys 


He 










-40 -35 








-30 




Asp 


Trp 


Ser 


Glu 


Arg Arg Asn Ala Val Ala 


Ser 


Val Val 


Ala 


Gly He 








-25 


-20 






-15 






Leu 


Phe 


Phe 


Thr 


Gly Trp Trp He Met He 


Asp 


Ala Ala 


Val 


Val 


Tyr 






-10 




-5 




1 






Pro 


Lys 


Pro 


Glu 


Gin Leu Asn His Ala Phe 


His 


Thr Cys 


Gly 


Val 


Phe 


5 








10 


15 








20 


Ser 


Thr 


Leu 


Ala 


Phe Phe Met He Asn Ala 


Val 


Ser Asn 


Ala 


Gin 


Val 










25 30 








35 




Arg 


Gly 


Asp 


Ser 


Tyr Glu Ser Gly Cys Leu Gly Arg Thr 


Gly 


Ala 


Arg 








40 


45 






50 






Val 


Trp 


Leu 


Phe 


He Gly Phe Met Leu Met 


Phe 


Gly Ser 


Leu 


He 


Ala 






55 




60 




65 








Ser 


Met 


Trp 


He 


Leu Phe Gly Ala Tyr Val 


Thr 


Gin Asn 


Thr 


Asp 


Val 




70 






75 




80 






Tyr 


Pro 


Gly 


Leu 


Ala Val Phe Phe Gin Asn 


Ala 


Leu He 


Phe 


Phe 


Ser 



90 95 100 
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Thr Leu lie Tyr Lys Phe Gly Arg Thr Glu Glu Leu Trp Thr 
105 110 



<210> 511 
<211> 130 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -28 . . -1 

<400> 511 

Met Asn Trp Glu Leu Leu Leu Trp Leu Leu Val Leu Cys Ala Leu Leu 

-25 -20 -15 

Leu Leu Leu Val Gin Leu Leu Arg Phe Leu Arg Ala Asp Gly Asp Leu 

-10 -5 1 

Thr Leu Leu Trp Ala Glu Trp Gin Gly Arg Arg Pro Glu Trp Glu Leu 
5 10 15 20 

'r.ig:hr Asp Met Val Val Trp Val Thr Gly Ala Ser Ser Gly lie Gly Glu 
% 25 30 35 

fpiu Leu Ala Tyr Gin Leu Ser Lys Leu Gly Val Ser Leu Val Leu Ser 

40 45 50 

'l^la Arg Arg Val His Glu Leu Glu Arg Val Lys Arg Arg Cys Leu Glu 
'rt 55 60 65 

iS^sn Gly Asn Leu Lys Glu Lys Asp lie Leu Val Leu Pro Leu Asp Leu 
U 70 75 80 

j'frhr Asp Thr Gly Ser His Glu Ser Gly Tyr Gin Ser Cys Ser Pro Gly 
i 85 90 95 100 

.iille Trp 



'■^k210> 512 

&211> 199 

;3!|c212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -62. .-1 

<400> 512 

Met Ser Gin Arg Ser Leu Cys Met Asp Thr Ser Leu Asp Val Tyr Arg 

-60 -55 -50 

Xaa Leu lie Glu Leu Asn Tyr Leu Gly Thr Val Ser Leu Thr Lys Cys 

-45 -40 -35 

Val Leu Pro His Met lie Glu Arg Lys Gin Gly Lys lie Val Thr Val 
-30 -25 -20 -15 

Asn Ser lie Leu Gly lie lie Ser Val Pro Leu Ser lie Gly Tyr Cys 

-10 -5 1 

Ala Ser Lys His Ala Leu Arg Gly Phe Phe Asn Gly Leu Arg Thr Glu 

5 10 15 

Leu Ala Thr Tyr Pro Gly He He Val Ser Asn lie Cys Pro Gly Pro 
20 25 30 
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Val Gin Ser Asn He Val Glu Asn 
35 40 
Thr He Gly Asn Asn Gly Asn Gin 
55 

Cys Val Arg Leu Met Leu He Ser 
70 

Trp He Ser Glu Gin Pro Phe Leu 

85 90 
Met Pro Thr Trp Ala Trp Trp He 

100 105 
He Glu Asn Phe Lys Ser Gly Val 
115 120 
He Phe Lys Thr Lys His Asp 
135 



Ser Leu Ala Gly Glu Val Thr Lys 

45 50 
Ser His Lys Met Thr Thr Ser Arg 

60 65 
Met Ala Asn Asp Leu Lys Glu Val 
75 80 
Leu Val Thr Tyr Leu Trp Gin Tyr 
95 

Thr Asn Lys Met Gly Lys Lys Arg 
110 

Asp Ala Xaa Ser Ser Tyr Phe Lys 
125 130 



<210> 513 
<211> 180 
<212> PRT 
<213> Homo sapiens 

^:Jc220> 

f^^221> SIGNAL 
'^222> -25 . . -1 

i1<400> 513 

iSyiet Asn Thr Val Leu Ser Arg Ala Asn Ser Leu Phe Ala Phe Ser Leu 
jJ-25 -20 -15 -10 

.riser Val Met Ala Ala Leu Thr Phe Gly Cys Phe He Xaa Thr Ala Phe 
• -5 15 

„|Lys Asp Arg Ser Val Pro Val Arg Leu His Val Ser Arg He Met Leu 
IS 10 15 20 

!'Lys Asn Val Glu Asp Phe Thr Gly Pro Arg Glu Arg Ser Asp Leu Gly 

25 30 35 

'Jphe He Thr Phe Asp He Thr Ala Asp Leu Glu Asn He Phe Asp Trp 
;&0 45 50 55 

i^ksn Val Lys Gin Leu Phe Leu Tyr Leu Ser Ala Glu Tyr Ser Thr Lys 
60 65 70 

Asn Asn Ala Leu Asn Gin Xaa Val Leu Trp Asp Lys He Val Leu Arg 

75 80 85 

Gly Asp Asn Pro Lys Leu Leu Leu Lys Asp Met Lys Thr Lys Tyr Phe 

90 95 100 

Phe Phe Asp Asp Gly Asn Gly Leu Xaa Gly Asn Arg Asn Val Thr Leu 

105 110 115 

Thr Leu Ser Trp Asn Val Val Pro Asn Ala Gly He Leu Pro Leu Val 
120 125 130 135 

Thr Gly Ser Gly His Val Ser Val Pro Phe Pro Asp Thr Tyr Glu He 
140 145 150 

Thr Lys Ser Tyr 
155 



<210> 514 
<211> 120 
<212> PRT 
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<213> Bos taurus 
<400> 514 

Met Met Thr Gly Arg Gin Gly Arg Ala Thr Phe Gin Phe Leu Pro Asp 

15 10 15 

Glu Ala Arg Ser Leu Pro Pro Pro Lys Leu Thr Asp Pro Arg Leu Ala 

20 25 30 

Phe Val Gly Phe Leu Gly Tyr Cys Ser Gly Leu lie Asp Asn Ala lie 

35 40 45 

Arg Arg Arg Pro Val Leu Leu Ala Gly Leu His Arg Gin Leu Leu Tyr 

50 55 60 

lie Thr Ser Phe Val Phe Val Gly Tyr Tyr Leu Leu Lys Arg Gin Asp 
65 70 75 80 

Tyr Met Tyr Ala Val Arg Asp His Asp Met Phe Ser Tyr lie Lys Ser 

85 90 95 

His Pro Glu Asp Phe Pro Glu Lys Asp Lys Lys Thr Tyr Gly Glu Val 

100 105 110 

Phe Glu Glu Phe His Pro Val Arg 
115 120 



^^;;^210> 515 

^l':^211> 1082 

Vk212> DNA 

h^212> Homo sapiens 

4;k400> 515 

ypatcccagac ctcggcttgc agtagtgtta 
IP'taacaggatc tcctcttgca gtctgcagcc 
J cgcagcccga agattcacta tggtgaaaat 
[.Sgaggaggcg cggcaagacg tggaggccct 
j!;gaccggcaag gagctccgag ttgccaccca 
J=%cttactctc ttaggccttt cattcatctt 
^^tacaagtac ttcatgccca agagcaccat 
'stgaggatcct gcaaattccc ttcgtggagg 
^ij^gctgacatt cgtgaggatg acaacattgc 
i|;ij|bgatagtgac cctgcagcaa ttattcatga 
cttgttgctg gggaactgct atctgatgcc 
aaatctggta gagctctttg gcaaactggc 
ggttcgagaa gacctagttg ctgtggagga 
tatttaccaa ctttgcaata acagaaagtc 
gggtttcaac aaacgtgcca ttgataaatg 
tattgttgag accaagatct gtcaagagta 
aagaagtcag agatttacaa tatgacttta 
ttactcatgc atttactcta ttgcttatgc 
aa 



gactgaagat aaagtaagtg ctgtttgggc 60 

caggacgctg attccagcag cgccttaccg 120 

cgccttcaat acccctaccg ccgtgcaaaa 180 

cctgagccgc acggtcagaa ctcagatact 240 

ggaaaaagag ggctcctctg ggagatgtat 300 

ggcaggactt attgttggtg gagcctgcat 360 

ttaccgtgga gagatgtgct tttttgattc 420 

agagcctaac ttcctgcctg tgactgagga 480 

aatcattgat gtgcctgtcc ccagtttctc 540 

ctttgaaaag ggaatgactg cttacctgga 600 

cctcaatact tctattgtta tgcctccaaa 660 

gagtggcaga tatctgcctc aaacttatgt 720 

aattcgtgat gttagtaacc ttggcatctt 780 

cttccgcctt cgtcgcagag acctcttgct 840 

ctggaagatt agacacttcc ccaacgaatt 900 

agaggcaaca gatagagtgt ccttggtaat 960 

acattaaggt ttatgggata ctcaagatat 102 0 

cgtaaaaaaa aaaaaaaaaa aaaaaaaaaa 1080 

1082 



<210> 516 
<211> 559 
<212> DNA 
<213> Homo sapiens 



<400> 516 

ctgctccagc gctgacgccg agccatggcg gacgaggagc ttgaggcgct gaggagacag 



60 
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aggctggccg agctgcaggc caaacacggg gatcctggtg atgcggccca acaggaagca 12 0 

aagcacaggg aagcagaaat gagaaacagt atcttagccc aagttctgga tcagtcggcc 180 

cgggccaggt taagtaactt agcacttgta aagcctgaaa aaactaaagc agtagagaat 240 

taccttatac agatggcaag atatggacaa ctaagtgaga aggtatcaga acaaggttta 300 

atagaaatcc ttaaaaaagt aagccaacaa acagaaaaga caacaacagt gaaattcaac 360 

agaagaaaag taatggactc tgatgaagat gacgattatt gaactacaag tgctcacaga 420 

ctagaactta acggaacaag tctaggacag aagttaagat ctgattattt actttgttta 480 

ttgtctatat gccttttaaa aaaataaact tgttatgcaa aaaaaaaaaa aaaaaaaaaa 540 

aaaaaaaaaa aaaaaaaaa 559 



<210> 517 
<211> 110 
<212> PRT 

<213> Homo sapiens 
<400> 517 



Met 


Phe 


Cys Pro 


Leu Lys 


Leu 


He 


Leu 


Leu 


Pro 


Val 


Leu 


Leu 


Asp 


Tyr 


1 






5 








10 










15 




Ser 


Leu 


Gly Leu 
20 


Asn Asp 


Leu 


Asn 


Val 
25 


Ser 


Pro 


Pro 


Glu 


Leu 
30 


Thr 


Val 


:i;His 


Val 


Gly Asp 


Ser Ala 


Leu 


Met 


Gly 


Cys 


Val 


Phe 


Gin 


Ser 


Thr 


Glu 


r?^sp 




35 






40 










45 








Lys 


Cys lie 


Phe Lys 


He 


Asp 


Trp 


Thr 


Leu 


Ser 


Pro 


Gly 


Glu 


His 




50 






55 










60 










^:^Ala 


Lys 


Asp Glu 


Tyr Val 


Leu 


Tyr 


Tyr 


Tyr 


Ser 


Asn 


Leu 


Ser 


Val 


Pro 


:;|fe5 






70 










75 










80 


y^ie 


Gly 


Arg Phe 


Gin Asn 
85 


Arg 


Val 


His 


Leu 
90 


Met 


Gly 


Asp 


Asn 


Leu 
95 


Cys 




Asp 


Gly Ser 
100 


Leu Leu 


Leu 


Gin 


Asp 
105 


Val 


Gin 


Asp 


Val 


Glu 
110 







'^4:210> 518 
^Ilk211> 4544 
03=2 12 > DNA 
<213> Homo sapiens 

<400> 518 

ccgagaaggg cttcaggacg cgggaggcgc acttgcttca agtcgcgggc gtgggaacgg 60 

ggttgcaaaa cggggccttt ttatccgggc ttgcttccgg cgtcatggct caaagggcct 120 

tcccgaatcc ttatgctgat tataacaaat ccctggccga aggctacttt gatgctgccg 180 

ggaggctgac tcctgagttc tcacaacgct tgaccaataa gattcgggag cttcttcagc 240 

aaatggagag aggcctgaaa tcagcagacc ctcgggatgg caccggttac actggctggg 3 00 

caggtattgc tgtgctttac ttacatcttt atgatgtatt tggggaccct gcctacctac 360 

agttagcaca tggctatgta aagcaaagtc tgaactgctt aaccaagcgc tccatcacct 42 0 

tcctttgtgg ggatgcaggc cccctggcag tggccgctgt gctatatcac aagatgaaca 48 0 

atgagaagca ggcagaagat tgcatcacac ggctaattca cctaaataag attgatcctc 540 

atgctccaaa tgaaatgctc tatgggcgaa taggctacat ctatgctctt ctttttgtca 600 

ataagaactt tggagtggaa aagattcctc aaagccatat tcagcagatt tgtgaaacaa 660 

ttttaacctc tggagaaaac ctagctagga agagaaactt cacggcaaag tctccactga 720 

tgtatgaatg gtaccaggaa tattatgtag gggctgctca tggcctggct ggaatttatt 780 

actacctgat gcagcccagc cttcaagtga gccaagggaa gttacatagt ttggtcaagc 840 

ccagtgtaga ctacgtctgc cagctgaaat tcccttctgg caattaccct ccatgtatag 900 

gtgataatcg agatctgctt gtccattggt gccatggcgc ccctggggta atctacatgc 960 
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tcatccaggc ctataaggta ttcagagagg aaaagtatct ctgtgatgcc tatcagtgtg 1020 

ctgatgtgat ctggcaatat gggttgctga agaagggata tgggctgtgc cacggttctg 1080 

cagggaatgc ctatgccttc ctgacactct acaacctcac acaggacatg aagtacctgt 1140 

atagggcctg taagtttgct gaatggtgct tagagtatgg agaacatgga tgcagaacac 1200 

cagacacccc tttctctctc tttgaaggaa tggctggaac aatatatttc ctggctgacc 1260 

tgctagtccc cacaaaagcc aggttccctg catttgaact ctgaaaggat agcatgccac 1320 

ctgcaactca ctgcatgacc ctttctgtat attcaaaccc aagctaagtg cttccgttgc 1380 

tttccaagga aacaaagagt caaactgtgg acttgatttt gttagctttt ttcagaattt 1440 

atctttcatt cagttccctt ccattatcat ttacttttac ttagaagtat ccaaggaagt 1500 

cttttaactt taatttccat ttcttcctaa agggagagtg agtgatatgt acagtgtttt 1560 

gagattgtat acatatattc cagaacttgg aggaaatctt atttaagttt atgaatataa 1620 

ccatctgtta ctgttctaaa aatgtttaaa agaaactcaa tacagataaa gataaatatg 1680 

tgactattat tgggtattac acttcacttc tctttaatat ttttcctcca actggagggc 1740 

agacaatttt ctgacttgct tttctctagg tggttcattt tgaaagggga cagaaatata 1800 

actaaatgct tccaggagaa aaattccaag agttacaatc tggacttggt acctaaatat 1860 

cattttttaa attcttgatg cctatttgga ctagaggtaa acatactttc agattggcct 1920 

gtttttgtcg gtaaggcata cagccttcag aagccaacat ttttaatcaa aaacttataa 1980 

aacatgatga tcattgtgaa aattctgagt tgaaggttag tttaagataa gctaacaata 2040 

acagtctgtg ttttctctaa aataatctga gttttttgga actctttatt taaatatgtg 2100 

tgtttttcag tattcaaata agatcaggaa gccaattttc tatgtatgaa tatgctttaa 2160 

cctaggattt cagtccactc tgactgactt tctaaacttt aacttgggtt tttacagtga 222 0 

..ptatgcatta gtgctgactc tttggtataa gccataaaat attttccttc ctatcaattt 228 0 

^Jl^tctgaactt tggtcttttc actaaattgt acagtattct acttctgttt aaaaagggga 2340 

,;5*^atgagaaag ggaatactat ctaaccaata acttgaacaa aaacactaaa ctaagcattt 2400 

■^4atagaaatg ctttttattg aggaggtatt atccagagtt catgcttaga acaaatgcat 2460 

f'ftttgcgtat cctagactta acaattcatc agtttctgag accacagaat caggttttcc 2520 

■^j^tagtagata aagactctct ggtgcttcaa attctgttca agtgttttga ctcatcagct 2580 

Jlctactcttt ctattactgc ctttgcctgg cttgttttgt ctctttgcaa ctgattttgc 2640 

j^kaaaaaaaat tgtagcttta aaataacagg gtctaagtat tttaaatgtg cctatttcac 2 700 

fpgctctcttg gtcacaaaaa catgctattt ttattggaac ttcaaaccaa atccccactg 2760 

: agtgtgtact ggttcctgca ggtagcagtc tcctattatc tcctgtttag caccaaaaga 2820 

„jgctaatatta ttggaaactg accttttaaa ggccactggc agtaggattt aaaaagcagc 2880 

^pcactgctca gtttccagga tcagcttcct ccttctgtca cttgtgtaag ttggcactac 2940 

J^ttgtgcctc tcagattgct gaagtgctgc tggtaagcat gtgcatgctc tgcctttctt 3000 

'gtgaaagttt tcaatcagcg atatcagcac ttacagtaag aagtaaaagt agtgcacagc 3060 

'^4aagctaatt tgcctttgcc tggggtgttc agcttgaaag aataaagctc atttggttta 3120 

l^ttaaatgtc ttactctact gtgcctatgc ttttagctgc gttactaagc aagggaaaaa 3180 

Jitaacagtttc tctgagccag agaagacttg atcacagttc tccaagcatc gtgatagcaa 3240 

tgcttaaccc caggaagatt tcaaggcagg gagaagaaca tttcaaataa gattcttgtt 33 00 

aacccattta tgcctagtgt tccattattg gaatgctaag cttgtgggag tcatttacat 3360 

cctactgctc aaagtcattg ccaaggtctg atttttcaca caaaaaattg caacccccag 3420 

cataaatggg ttagctactg tcatcagtta gcaaattcat ccacacaaac acaattagag 3480 

tttggttttt ttttaagctt ttcaaaactt actaaactgg cacaatttta tatgtatgct 3540 

atttgttgta tttatgctta agagcaaaaa agttttgatg ggattttaaa ttcagcaaag 3600 

cctacaacgc tgagacaatc ccctaacaac atggtagtaa ctaaagaaac ttttatacta 3660 

ggcttcttag ttttaaaagg aagtggcatc attgtttcag ttctagtttg tatttttctc 3720 

tcagatattt ttcttcttta aaaatctttc ccagaagttg gttcctagaa aactcaatac 3780 

catcatctct tatctctata cagggactag gtaataaaac cttcaaaggt tgtcaaaggt 3840 

catcaagcag tgttcattta tcctgtcaca tgtttctgtt tctatagtaa tttagaaatt 3900 

gcaaatagtt aacttttcat catgtaaaaa gttaacatta tcctatttcc atagatacca 3960 

tggacggcgg tgtggcctga gttgtcagtc tttaatcctg agtcatgtgg ctctcttttc 4020 

atctttgatg tcagttccaa ttatttggca tcaaaaacct tcatggtagg tagagtttta 4080 

ggtaaaagtg gatctagggt tactttcttt attaacattt cctaaataac tgaattgaga 4140 

gacatactct gctactatgt cctcaggtta atttttgtct gatcttacga tgccctgcct 4200 

tttactagct actttagaaa tagaaaatgt gaagagtgac tatttacatg tatactcctt 4260 

tggctgctag aactcatctg tagtccttta ttatttacac tgaattccaa tttcatttct 432 0 

cttccgctaa gtaagagcac ctcattcctg tgttttctct actattgagc tgtagacgaa 4380 
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ctgtttctct aattataaag caaactgttt gggatattca gggaaactac cccaatgtta 4440 
tgttgtcatt taatgggaaa ggctgggatc atatgtattt ctatgttctg taaagtattt 4500 
gacttactag ttctcaataa aattttatta ggactataaa aaaa 4544 



<210> 519 

<211> 1779 

<212> DNA 

<213> Mus musculus 

<400> 519 

ggtccggaat tcccgggtcg acccacgcgt 
tcccggggct gcctctttaa gggagggggt 
gaaatcttga ggccagagcc ccgcacctcg 
<^agggcagaa ccaagagcag tttctgctcc 
ccacactcat ccaccaggtg ctggaggccc 
atatgcctaa tgttagagag ctggcagaaa 
cagtgtttgc ctatgggacc tatgcggact 
tgactgacgc acagaagaat aagcttcgac 
tcaagtgtat cccatatgca gtgttgctgg 
;;u=r,tggaagacct tgtgatcgag gctgtgtatg 
'^^^caatcagcg gctagaggtt gattacagca 
JiH^tgccatcgc ccagaccctg caagagtggt 
}'A:cgaagagca ggtcagccgt gccaaccagc 
^^gatcgaaag tgaggttgcc aaccttaaga 
U'btgcagccac ctcccaggat cctgagcaac 
:;|^caccaacca gcgccagccc agcaagaaag 
|:^|ccaagatttg gtccaagtcg aactgaaagg 
[ffcagctgccta cctgcctacc ccttaggagt 
J "tgataatgct agttcattac ttttcatctc 
tj^gtagggagga ggccagtgca ggtcatgttc 
.tccccagcac gcttgctctc atcgtttttc 
pjtgagcaggtt ggcagtccta tggagggtgc 
5"''atgcctaatc tgcagttcct cctcctcccc 
^^fegggaagggg gtctccacct tcattccagc 
yfeggagtcgct ggacattttc ctcccagaac 
pictcatctcct tgggcctggg gatactgctg 
atttttgaga tgttaagaaa tggttctttg 
atcctggaag gatttatatc tcctcctgtg 
tattaaaaat aaaaaatata tatgaatagg 
aaatgtatga gaaatgtatg tacaaaaaaa 



ccgctggcct tgggcgcaga ccccggccgg 60 

ggagccgcga gtcaggcgcg aggagctcca 12 0 

gcgcagccat gagtgcggag gtgaaggtga 180 

ttgccaagtc ggctaagggg gcggcactgg 240 

ctggtgtcta cgtgtttggg gaactgctgg 300 

gcgactttgc ctccaccttc cggctgctca 360 

acttagctga agccaggaat ctccccccac 420 

atctgtcagt tgtcactctg gctgccaaag 480 

aggcccttgc ccttcgaaac gtgcgccagc 540 

ctgatgtcct tcgtggctct ctggaccagc 600 

tcgggcggga catccagcgc caggacctca 660 

gcgtgggctg tgaggttgtg ttgtcgggca 720 

acaaggagca gcagctgggc ctgaagcagc 780 

aaaccattaa agttacgaca gcagctgctg 840 

acctgacaga gctgagagaa ccagcttctg 900 

cctccaaggg caagggactc cgagggagcg 960 

acttgtttct tccctgggaa tgtggggtcc 1020 

cctcagagcc ttcctgtgcc cctggccagc 1080 

ctccaccccc aagcataagc cacaccctct 1140 

tgttggtacc tcttatgtgt tccatgctct 1200 

cgcactgtgt ctgcccatta cccctgtcat 1260 

tggctcttaa ccacccacac ctacccctgc 132 0 

ttgcctagtg ggctgcatct gaaaagccat 1380 

cttagagttc tggagccagt ctgctaccct 1440 

cccatcacac tacaattgtt tctttcctct 1500 

cttcagtgac cccagagcct gagaacagct 1560 

ttgctcatca tcttaggaag cccaatggaa 1620 

gttctggtgg ggaaggaaat atagattgta 1680 

tctatatata ttgacacatg acacagaaat 1740 

aaaaaaaaa 1779 
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EXTENDED cDNAs 

Abstract of the Disclosure 
The sequences of extended cDNAs encoding secreted proteins are disclosed. The 
extended cDNAs can be used to express secreted proteins or portions thereof or to obtain 
antibodies capable of specifically binding to the secreted proteins. The extended cDNAs 
may also be used in diagnostic, forensic, gene therapy, and chromosome mapping 
procedures. The extended cDNAs may also be used to design expression vectors and 
secretion vectors. 
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BamH! 2772 Clal 2666 



P(asmld name: pED6dpc2 
Plasmid size: 5374 bp 



Comments/References: pED6(Jpc2 is derived from pED6dpc1 by insertion of a new 
po(ylinker to facilitate cDNA cloning. SST cDNAs are cloned between EcoRI and Noll. 
pED vectors are described in Kaufman et ai,(1991), NAR 19: 4485-4490. 
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TG7TC5AGTTG 
CCCAACTQAC 
AATAGAATTAG 
AACTAAATTAG 
QCAGACCTCAG 
AGATAAATCCA 
CTTCAGTTG 
. TTQTAGATAGGACA 
AGATAGGACAT 
CATAACAQATQGTAAQ 
CATAACAQATGQTAAG 
CATAACAOATGGTAAG 
AOCATCTGTT 
TCAAGlATAAAQTA 
AGrrGGGAATTCC 
AGTTGGGAATTC 
TGGQAATrca 
TCAGTGATATGGCA 
TAAAACAAAACA 
TTTAGGGC 
TGAGGQGA 



GGACCAATCAT 

CCTGQGGA 

TQACOGTTG 

TCCAACGGT 

TTCCTGGAA 

TTCCAGGAA 

TTGGQQQA 

QAATQGGArrrc 

AGAGGGQA 

GAAAACAAAAGA 

GAAGGGGA 

AQCATCTGCC 

TCCCACCTTCC 

<3AGGCAATTAT 

AGAGGGGA 



GGACrCACaTGCTGC 

ACTCACGTGCTG 

ACTGAOGTGCTG 

OAQCAOQTQAQT 

OAQCAOQTQAQT 

CAQGAOQTQAQT 

TCACaTQC 

GOAOaTQA 

CATGGGGA 

CTCTCOGQAAGCCT 

TCCGQAAGCC 

AGTGACTQAAC 

AaTGAOTGAAC 

TGTGGTCTC 
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100.0% identity in 125 aa overlap 

10 20 30 40 50 60 

SEQ ID NO: 217 MADEELEALRRQRLAELQAKHGDPGDAAQQEAKHREAEMRNSILAQVLDQSARARLSNLA 
— — — — — • * — : i : 
SEQ ID NO: 516 mDEELEALRRQRLAELQAKHGDPGDAAQQEAKHREAEMRNSILAQVLDQSARARLSNLA 

10 20 30 40 50 60 

70 80 90 100 110 120 

SEQ ID NO: 217 LVKPEKTKAVENYLIQMARYGQLSEKVSEQGLIEILKKVSQQTEKTTTVKFNRRKVMDSD 
— — — — — — — — — — — ::::::::::::::::::::::::::::::::::: — : 

SEQ ID NO: 516 LVKPEKTKAVENYLIQMARYGQLSEKVSEQGLIEILKKVSQQTEKTTTVKFNRRKVMDSD 

70 80 90 100 110 120 



SEQ ID NO: 217 EDDDY 
: : : :X 

SEQ ID NO: 516 EDDDY 
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CLUSTAL W(1.5) multiple sequence alignment 
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LVFKKTIVLHVSPEEPRTLVTPAALRPLVLGGNQLVIIVGIVCATILLLPVLILIVKKTC 



GNKSSVNSTVLVKNTKKTNP 
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99.6% identity in 225 aa overlap 

10 20 30 40 50 60 

SEQ ID NO: 515 PTAVQKEEARQDVEALLSRTVRTQILTGKELRVATQEKEGSSGRCMLTLLGLSFILAGLI 

SEQ ID NO: 231 LRVATQEKEGSSGRCMLTLLGLSFILAGLI 

10 20 30 

70 80 90 100 110 120 

SEQ ID NO: 515 VGGACIYKYFMPKSTI YRGEMCFFDSEDPANSLRGGEPNFLPVTEEADIREDDNIAIIDV 



SEQ ID NO: 231 VGGACIYKYFMPKSTIYRGEMCFFDSEDPANSLRGGEPNFLPVTEEADIREDDNIAIIDV 

40 50 60 70 80 90 

130 140 150 160 170 180 

SEQ ID NO: 515 PVPSFSDSDPAAIIHDFEKGMTAYLDLLLGNCYLMPLNTSIVMPPKNLVELFGKLASGRY 



SEQ ID NO: 231 PVPSFSDSDPAAIIHDFEKGMTAYLDLLLGICYLMPLNTSIVMPPKNLVELFGKLASGRY 

100 110 120 130 140 150 

190 200 210 220 230 240 

^SEQ ID NO: 515 LPQTYVVREDLVAVEEIRDVSNLGIFIYQLCNNRKSFRLRRRDLLLGFNKRAIDKCWKIR 



:SEQ ID NO: 231 LPQTYVVREDLVAVEEIRDVSNLGIFIYQLCNNRKSFRLRRRDLLLGFNKRAIDKCWKIR 
I 160 170 180 190 200 210 

; 250 260 

:SEQ ID NO: 515 HFPNEFIVETKICQE 



SEQ ID NO: 231 HFPNEFIVETKICQE 

220 
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99.7% identity in 353 aa overlap 

10 20 30 

SEQ ID NO: 196 MERGLKSADPRDGTGYTGWAGIAVLYLHLY 



SEQ ID NO: 518 LAEGYFDAAGRLTPEFSQRLTNKIRELLQQMERGLKSADPRDGTGYTGWAGIAVLYLHLY 
20 30 40 50 60 70 

40 50 60 70 80 90 

SEQ ID NO: 196 DVFGDPAYLQLAHGYVKQSLNCLTKRSITFLCGDAGPLAVAAVLYHECMNNEKQAEDCITR 



SEQ ID NO: 518 DVFGDPAYLQLAHGYVKQSLNCLTKRSITFLCGDAGPLAVAAVLYHKMNNEKQAEDCITR 
80 90 100 110 120 130 

100 110 120 130 140 150 

SEQ ID NO: 196 LIHLNKIDPHAPNEMLYGRIGYIYALLFVNKNFGVEKTPQSHIQQICETILTSGENLARK 



SEQ ID NO: 518 LIHLNKIDPHAPNEMLYGRIGYIYALLFVNKNFGVEKIPQSHIQQICETILTSGENLARK 
140 150 160 170 180 190 

^ 160 170 180 190 200 210 

ISEQ ID NO: 196 RNFTAKSPLMYEWYQEYYVGAAHGLAGIYYYLMQPSLQVSQGKLHSLVKPSVDYVCQLKF 

ISEQ ID NO: 518 RNFTAKSPLMYEWYQEYYVGAAHGLAGIYYYLMQPSLQVSQGKLHSLVKPSVDYVCQLKF 
1 200 210 220 230 240 250 

. 220 230 240 250 260 270 

^SEQ ID NO: 196 PSGNYPPCIGDNRDLLVHWCHGAPGVIYMLIQAYPCVFREEKYLCDAYQCADVIWQYGLLK 



SEQ ID NO: 518 PSGNYPPCIGDNRDLLVHWCHGAPGVIYMLIQAYKVFREEKYLCDAYQCADVIWQYGLLK 
260 270 280 290 300 310 

280 290 300 310 320 330 

iSEQ ID NO: 196 KGYGLCHGSAGNAYAFLTLYNLTQDMKYLYRACKFAEWCLEYGEHGCRTPDTPFSLFEGM 

fSEQ ID NO: 518 KGYGLCHGSAGNAYAFLTLYNLTQDMKYLYRACKFAEWCLEYGEHGCRTPDTPFSLFEGM 
^ 320 330 340 350 360 370 

340 350 
SEQ ID NO: 196 AGTIYFLADLLVPTKARFPAFEL 



SEQ ID NO: 518 AGTIYFLADLLVPTKARFPAFEL 
380 390 
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98.5% identity in 194 aa overlap 

90 100 110 120 130 140 

SEQ ID NO: 519 ARNLPPLTDAQKNKLRHLSWTLAAKVKCIPYAVLLEALALRNVRQLEDLVIEAVYADVL 



SEQ ID NO: 158 ARNLPPLTEAQKNKLRHLSVVTLAAKVKCIPYAVLLEALALRNVRQLEDLVIEAVYADVL 
60 70 80 90 100 110 

150 160 170 180 190 200 

SEQ ID NO: 519 RGSLDQRNQRLEVDYSIGRDIQRQDLSAIAQTLQEWCVGCEVVLSGIEEQVSRANQHKEQ 



SEQ ID NO: 158 RGSLDQRNQRLEVDYSIGRDIQRQDLSAIARTLQEWCVGCEWLSGIEEQVSRANQHKEQ 
120 130 140 150 160 170 

210 220 230 240 250 260 

SEQ ID NO: 519 QLGLKQQIESEVANLKKTIKVTTAAAAAATSQDPEQHLTELREPASGTNQRQPSKKASKG 



SEQ ID NO: 158 QLGLKQQIESEVANLKKTIKVTTAAAAAATSQDPEQHLTELREPAPGTNQRQPSKKASKG 
180 190 200 210 220 230 

270 

SEQ ID NO: 519 KGLRGSAKIWSKSN 



SEQ ID NO: 158 KGLRGSAKIWSKSN 
240 250 



88.7% identity in 62 aa overlap 

10 20 30 40 50 60 

SEQ ID NO: 519 MSAEVECVTGQNQEQFLLLAKSAKGAALATLIHQVLEAPGVYVFGELLDMPNVRELAESDF 



SEQ ID NO: 158 MSAEVKVTGQNQEQFLLLAKSAKGAALATLIHQVLEAPGVYVFGELLDMPNVRELXARNL 

10 20 30 40 50 60 



SEQ ID NO: 519 AS 
.X 

SEQ ID NO: 158 PP 



FIGURE 14 



15/15 



68.9% identity in 74 aa overlap 

10 20 30 40 50 

SEQ ID NO: 226 MIARRNPVPLRFLPDEARSLPPPKLTDPRLLYIGFLGYCSGLIDNLIRRRPIATAGLHR 



SEQ ID NO: 514 MMTGRQGRATFQFLPDEARSLPPPKLTDPRLAFVGFLGYCSGLIDNAIRRRPVLLAGLHR 

10 20 30 40 50 60 

60 70 
SEQ ID NO: 22 6 QLLYITAFFLLDIIL 

SEQ ID NO: 514 QLLYITSFVFVGYYLLKRQDYMYAVRDHDMFSYIKSHPEDFPEKDKKTYGEVFEEFHPVR 

70 80 90 100 110 120 
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